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DESCRIPTION 

METHOD OF DETECTIONQFNUCLEIC ACIDS 
WITH A SPECMC SEQUENCE COMPOSITION 

Background of the Invaition 

L Field of the Invcntioii 

This invexmon provide a method and compositions for use in binding, ^tecting, and 
amplifying the cfetection of specific Target Nucleic Acid sequences in a sample with fidelity and 
accuracy, even in the presence of closely lelated but difiHeot nucleic acids. The binding may involve 
the chaperoning and assembly of specific molecules into Target Binding Assemblies which 
specifically bind Target Binding Regions fonned by the hybridization of Probe Nucleic Acids and 
Target Nucleic Add sequences. The amplifying may involve the chaperoning and/or assembly of 
specific molecules into Booster Binding Assemblies which specifically bind Booster Binding 
Regions formed by the hybridization of Booster Nucleic Adds with Probe Nucleic Acids, Target 
Nucleic Acids, or other Booster Nucleic Adds. A method; aad con^jositions, involving Hairpin 
Nucleic Acids is also provided to enable control of the size of specifically or non-spccifically 
elongated Booster Nucleic Acids and Booster Binding Assemblies used in the amplification. The 
detecting involves providing one or more detectable labels, including radioactive, light- or 
fltKsresccnt-cmitting, Ki^naiatic, or other detectable or signal-generating molecules, in association 
W&. the Probe Nucleic Add, the Target Binding Assembly, the Booster Nucleic Acid, the Booster 
Binding Assembly, or the Hairpin Nucldc Add A method is presented for isolating nucleic acid 
fragments fi^om an organism \^ch has T6A component binding sites in order to create a probe 
nucleic acid and a TBA which is unique for that firagment and/or organism. Therapeutic and 
prq)hylactic uses of die Taig^ Binding Assemblies and con^}ositions for such use are also provided. 



2. Background and Description of Related Art , 

There are an increasing number of cases in ^ch it is important to be able to detect nucleic 
adds containing a specific sequence, hereinafter named Target Nucleic Acids (TNAs), in a sample. 
It is desirable to be able to detect the TNAs with the smaUest number of processing steps, with the 
simplest components and to the exdusion of other similar but diEfemt nucleic acids, hereinafter 
named Cousin Nucleic Acids (CNAs). It is desirable to be able to detect specific TNAs to the 
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exclusion of any and all CNAs in the detection sanq)le without the necessity of amplification or other 
post-detection processing. 

Une are numerous methods which use immobilized or tagged nucleic acids as probes for 
TNAs. However, using known inethods^ is difficult to discrirninatebetwem 

5 Pn>beNucidcMd(PNA)asq5posedtoaCNAboundtothePNA. Forexan^le,oneormOTebase 
mismatches between the PNA and a CNA can stiU result in a CNA*PNA hybridization which is 
ahnost indistinguishable from a TNA-PNA hybridization. Thus, I^bridization alone is not an 
optimal indicator that a PNA has hybridized to a unique TNA. 

There are many situations in vdrich a PNA would be used to try to determine whether a TNA 

10 was present in a sample which may contain CNAs. Hybridization of the PNA to any CNA in this 
situation would limit the diagnostic value that the PNA might have for the detection of a TNA, 
absent additional verification. Furthermore, it is desirable to be able to detect and locali^ TNAs 
with low copy numbers in san^les which may contain mai^ copies of CNAs, without the necessity 
of creating additional copies of the TNA. It would also be desirable to be able to confirm the 

15 presence of CNAs, indqjendent of the TNAs, without the necessity of sQ)arating the CNAs and 
TNAs in the sample. 

FurthcmaOT^ it would be d^irable to be able to amplify the signal of even a low frequency 
hybridization of a particular TNA-PNA. For this purpose, a method of polymerizii^ multiple copies 
of a label, hereinaft^ referred to as a Booster Nucleic Acid (BNA) onto the TNA-PNA would be 
20 desirable. 

The instant invention provides methods and compositions for achieving the foregoing 
desired objectives. As revealed by the following review, the instant compositions and methods have 
not beairepOTted or suggested in the art A general and comprehensive review of die state of art of 
nucleic acid &te<^on is provided in I^Iler, H., M.M. Manak (1989) DNA Probes^ Stockton Press. 

25 A metkxi has been rq>Qrted for d^ecdng base pair mismatches by chemical means in order 

to determine whether a PNA has hybridized to a CNA rather than to a TNA. hi U.S. Patent No. 
4,794,075 to Ford et aL , a method for distinguishing fragments of DNA which contain single base 
mismatches from ti^ir perfectly paired homologs is discussed. Single stranded rcgions within a 
duplex firagment are modified with carbodiimide, which reacts with unpaired guanine (G) and 

30 thymine (T) residues in DNA. Linear duplex DNA molecules do not react, while DNA molecules 

with single base mismatches react quantitatively. Following reaction vnfh carbodiimide, the DNA 
molecules are fi^onated on hi^ percentage pofyaciylamide gels such that modified and 
unmodified fragnaents can be distinguished. Ford a/. appUed this techmque in order to locate and 
purify DNA sequence differences responsible for phenotype variation and inherited disease. 
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Although this method is usefiil for following variations in genetic material, it has a large number of 
steps, it requires costly components, and it does not off©* a direct means of detomining whether a 
PNA has Itybridized to the TNA exclusive of CNAs in the sampk 

Tha^ have beai some atten^ts to assure that at least a portion of the hybridiza^ 
5 the PNA and another nucleic add is complementary. One method involves the monitoring of 
transaiption products which are produced if the PNA hybridizes to a nucleic acid sufficiently to be 
transcribed JSrom a promoter site contained in the probe. U.S. Patent No. 5^15,899 to Dattagupta 
discloses how spedfic nucleic add sequences are an5}lified through the use of a hairpin probe which, 
iqxm faybridizatioa with and ttgation to a target sequence, is capable of b The probe 

1 0 comprises a single stranded sdf-ccnqjkmentaiy sequence which, under hybridizing conditions, fonns 
a hairpin structure having a functional promoter region, and further con5)rises a single stranded 
probe sequence e?£temfiijg fim the 3' end of the hairpin sequence. Upon hybridization with a target 
sequence conq)lementary to the probe sequaice and ligation of the 3' aid of the lybridized target 
sequence to the 5 ' end of the hairpin probe, the target sequence is rendered transcribable in the 

15 presence of a suitable RNA polymerase and appropriate ribonuclec^de triphosphates (rNTPs), 
Amplificatiosi is accon:^)lished by iyhridizing the desired TNA sequence with the probe, ligating the 
TNA to the PNA, adding tte RNA polym^se and the rNTPs to the separated hybrids, and allowing 
traiiscription to pn)ceed until a desired amount ofRNAtraiiscription product 1^ That 
metlKKi generally and spedfically involves the use of hairpin DN A formed with a single stranded 

20 unpaired end to anneal a target sequence. When the target sequence is bound, the production of 
RNA transcription products is enabled Thus, the method involves the detection of secondary 
transcription products rather than the use of a nucleic acid binding assembly to directly immobilize 
and/OT localize a target sequence, A CNA could easily bind to the probe, and the lack of 
complementarity would not necessarily interfere with the formation of a CNA-PN A h>i>rid vrfiich 

25 could then support the produ<^on of unwanted transcription products. 

A CNA bound to d© PNA might be detected if the lack of complmentarily interferes with 
the susceptibility of the hyimd CNA-PNA pair to be cut by a restriction endonuciease. In US. 
Pa^No. 5,1 18,605 to IWea and US. Patent No. 4,775,619 to Urdea, novel methods for assaying 
a nucleic acid analyte were provided, which employ polynucleotides having ohgonudeotide 

30 sequences substantialfy honK>logous to a sequence of interest in the analyte, where the presaice or 
^ absence of hybridization at a predetermined stringency provides for the release of a label from a 
stpxMl VaricKis techniques are employed for binding a label to a support, whereupon cleavage of 
either a single or double strand, a label msy be released fi^m a siq)port, and the release of the label 
can be detected as indicative of the presence of a particular polynucleotide sequence in a sample. 
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However, this technique has the shortooniing that a CNA-PNA pair could be cut by the restriction 
endonuclease, evoi if tterc is a mkmatch, so long as the mismatch was outside of tte endonuclcase 
recognition region. This would ie^ to feilure of the assay to identify a CNA-PNA hybrid 

Another method uses a branched DNA probe to detect nucleic acids. U.S. Patent No. 
5,124^46 to Utdea et aL discloses linear car branched oligonucleotute multimers usefiil as an:Q)lifiers 
in biochemical assays which comprise (1) at least one first single-stranded oligonucleotide unit 
(PNA) that is ccwnplemcntary to a single-stranded oligonucleotide sequence of interest (TNA), and 
(2) a mult5)licity of second singje^stranded, oligonucleotide units that are complementary to a single- 
stranded labeled oHgonucieotide. Although amplified sandwich nucleic acid hybridizations and 
immunoassays using the multimers are described, the method has the limitation that PNA-CNA 
hybridization could occur and would result in production of unwanted signal. 

In addition to methods for identification of TNAs, methods have been disclosed for the 
pmpitfi^^tinn of this DNA, lu US* Patent No. 5^00^ 14 to Urdea, an analyte polynucleotide strand 
having an analyte sequent (TNA) is detected widiin a sample containing pdynucleotid^ by 
contacting the analyte polynucleotide with a capture probe (PNA) under hybridizing conditions, 
where the capture probe has a first binding partner specific for the TNA, and a second binding 
sequaice specific for a solid phase third binding partner. The resulting duplex is then immobilized 
fay specific binding between lite binding partners, and non-bound polynucleotides are separ^ed from 
the bound species. The anafyte polynucleotide is optionally displaced from tte solid phase, thai 
mnphSsd by PGR, The PGR primers each have a pofynucleotide region cj^able of hybridizmg to 
a region of the analyte polynucleotide, and at least one of the primers fiuther has aa additional 
binding partner capable of binding a solid-phase binding partner. The an5)lified product is then 
separated from the reaction mixture by specific binding between tie binding partners, and the 
amplified product is d^ted, AUhough it is possible to confirm (by PGR) that a particular nucleic 
acid has hybridized with the PNA, confirmation is expensive and involves multiple steps. 

As for reports that involve the interaction of a double stranded nucleic acid and a DNA- 
binding protein, a method has been described vrfiereby a sequence of immobilized DNA \diich 
contains binding sites for a single protein is used to purify that protein. U*S. Patoat No. 5, 122,600 
to Kam&xM et aL discloses a DNA-immobilized microsphere comprising DNA chains having base 
sequences which specifically bind a particular protein, and a carrier having a particle size of not more 
than 5 0 |im and iK>t loss than 0. 0 1 pm which does not adsorb any protan, said carrier and said DNA 
diains being bound to each oflier by a chemical bond, and a proces s for purifying a protein using s aid 
micro^here. As this is a purification m^hod for a pix>tein, it does not disclose a method of det^ 
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of a TNA nor a method whereby more than one protein is bound to a double stranded nucleic acid 
for the purposes of detection and localization of specific TNA sequences. 

La EP 0 453 301, a method for detecting a polynucleotide target sequence in a sample was 
described \dbercin sequences in a TNA are detected by hybridizing a firs t and a second PN A to the 
5 TNA Each of said first and second PNAsccxitained a pxe-foimed duplex sequence, or a di^lex^ 

is formed through chain extensicm, capable of binding a nucleotide sequence specific binding protein 
A method for binding a nucleotide specific binding protein to a duplex formed between a TNA and 
a PNA only upon formation of a duplex between the FNA and TNA is neither disclosed nor 
suggested. 

0 In US. Patent No. 4,556,643, a method was disclosed for the non-radioac^ve detection of 

specific nudeotuie sequences in a sample which involved hybridization of a probe containing DNA 
binding protein specific sequoices. However, this disclosure neither taught nc»: suggested a method 
for binding a nucleotide specific binding protein to a duplex formed b^een a TNA and a PNA only 
upon formation of a duplex between sequences present in the PNA and sequences present in the 

5 TNA 



Brief Summary of the Invention 
Disclosed are nwdwds by vAnch specific Target Nucleic Acid (TNA) sequences are detected 
througji the use of Probe Nucleic Acids (PNAs) which, upon hybridization with TNAs, are capable 
ofbinding Target Binding AssembUes (TBAs). Each TEA binds at least one specific region of the 
PNA-TNA hybrid pair, the Target Binding Region (TBR). The TEA is comprised of one or more 
molecules, one or more of which can bind to TBR sequences m a specific and sequence or 
conformation dependent manner. The TBA may con^rise one or more piloting sequences, called 
"PILOTS*' or "Asymmetry Sequences,** which assemble and constrain the nucleotide binding 
oooqxments of the TBA to specific geometries. The PILOTS act to assemble specific nucleic acid 
recogmtion units or other pilots to which specific nucleic acid r^gnition units are attached into the 
TBAs in a predetomined fashion. The TBA may also contain one or more molecules which anchor 
or localize the TBA. Novel TBAs having unique discriminating characteristics which surprisingly 
render the TBAs usefiil not only as diagnostic tools but also as prophylactic or therapoitic 
conqxjunds, are also disclosed Efeclosed arc methods and compositions for utilization of the PNAs, 
TBRs, TBAs, and TBA PILOTS, including their utilization as components of diagnostic and 
forensic test kits and the utilization of the novel TBAs as prophylactic or therapeutic agents. 
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ThQ PNAs, in addition to TNA-^mfic sequences^ may also contain one or more sequences, 
1/2 BBRs, capable of hybridizing with complementary 1/2 BBRs in Bo(»ter Nucleic Acids 
(BNAs). Through hybridization of added B^As to the starter 1/2 BBRs present in the PNAs, 
extensions of the PNAs are made in the form of PNA-BNA and then BNA-BNA hybrids. These 

5 extensions can contain one or more Booster Binding Regions (BBRs). Each BBR is capable of 
binding a Booster Binding Assembly (BBA). The BBA is comprised of molecules, one or more of 
\sdiich can bind to a BBR in a specific and sequence or coirfonnation dependent The BBA 

may c<ai?)rise one or more piloting seqi^nces, called "PILOTS" or "Asymmetry Sequoices " ^ch 
assonbleaiKi constrain the iiucleotide binding components of The 

10 PILOTS act to assemble specific nucleic acid recogmti<m units or oOtsx pilots to which specific 
nucldc add recognition uriits are attached into the BBAs in a predetera^^ TheBBAmay 
comtaia molecules vrfrich anchor or localize liie BBA cr ^ch allow for <ktection of the bound BBAs 
and thereby of the TBA-TNA-PNA complexes to which they, in turn, are bound. Disclosed are 
methods and compositicxjs fOTutilizati<^ of the 1/2 BBRs, BNAs, BBRs, BBAs, and BBA PILOTS, 

15 including their utilization as components of diagjiostic and forensic test kits. 

Methods and compositions are disclosed for the use of Hairpin Nucleic Acids (HNAs) as 
capping structures. The HNAs contain a self-hybridizmg r^on and a smgle stranded 1/2 BBR 
vMdi^ under byfaridiziag conditions, can hybridize directly to the 1/2 BBRs in the PNAs or the 1/2 
BBRs in BNAs afreacfy bound to the PNAs, to terminate the extension of BNAs onto the PNA or 

20 onto other BNAs. 

Methods and conqxKiticns are disclosed for test procedures and the production of a test kit 
ccntainii^ PNAs, TBAs, TBRs, BNAs, BBRs, BBAs and HNAs for the detection, localizati<Hi and 
difforentiatioii of specific nucleic acid sequences, including nucleic acid sequences which are found 
in human cells, in the Human Immunodeficiency Virus (HTV), Human Papillomavirus (HPV), and 

25 in other nucleic acid containing systems including viruses and bacteria. 

Accordingly, it is an object of this mvention to provide metiiods and compositions for use 
in bindir^ detecting, and amplifying the detection of specific Target Nucelic Acid sequences in a 
sanq)le with fidelity and accuracy, evoa in the presraice of closely related but different nucleic acid 
sequences. 

30 Accordingly, it is an object of this invention to provide methods and compositions for the creation 
of Target Binding Assemblies which specifically bind Target Binding Regions formed by the 
hybridization of Probe Nucleic Acids and Target Nucleic Acid sequences. 

AiK)ther object of this invention is to provide a method and compositions for the creation 
of Booster Binding Assanblies which specifically bind Boc^ter Binding Regions formed by the 
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hybridization of Booster Nucleic Acid sequences with Probe Nucleic Acids, Booster Nucleic Acids 
and Hairpin Nucleic Acids. 

Anotfao* object of this iBventioEi is to provide a mi^faod and compositions containing Hairpin 
Nucleic Acids \^ch enable the control of the size of spedficalfy or non-spedfically elongated 
5 Booster Nucleic Acids and Bo(^ter Binding Assembli^ used in amplification of PNA-TNA 
hybridization events. 

Another object of this invention is to provide a method and compositions for use in the 
selection, assembly and or chaperoning of specific molecules, eadt Avith nucleic acid binding 
discriminating capabilities, into Target and Booster Binding Assemblies. 
0 Another object of this invention is to provide a method and ccnnpositions for use in 

amplifying the detection of Target Binding assemblies bound to Target Binding Regions using 
Booster Bmding Assemblies and Booster Nucleic Acids. 

Another object of this invention is to provide a method and compositions which allow the 
use of one or more detectable labels, including but limited to radioactive labels, light emitting, 
5 fluoresc^t, ^izymatic or other signal generating molecules. These labels are used in association 

with Probe Nuddc Adds, Taig^ Binding Assemblies, Booster Binding AssembUes, Booster Nucleic 
Acids or Hairpin Nucleic Adds* 

Another object of this inv^tion is to provide a method for isolating nucleic acid fragments 
fcMn an organism which has TBA component binding sites in order to create Probe Nucleic Acids 
0 and TBAs which are unique for that fragment or organism. 

Brief Description of the Drawings 
The foDowingiBusirations are contained in Figure 1: Figure l-I is a PNA containing a 1/2 
TBR, which is a single-stranded sequence which is complementary to a TNA and a 1/2 BBR 

5 sequence. Figure 1-IIa is a TNA to vrfuch is added the components of Figure l-I, and, under 

hybridizing conditions, binds the PNA to form the compcments of figure 1-IIIa, a PNA-TNA 
hybrid containing at least one TBR. Rgure 1-IVa is a BNA which is added to the con^onents of 
Figure l-DIa and, under hybridizing conditions, binds fhe 1/2 BBR of Figure 1-IIIa to form a PNA- 
BNA hybrid containing a BBR shown m Figure 1-Va. 

0 Figure 1 -lib is a BNA which is added the components of Figure l-I, and which, under 

hybridizing conditicMis, binds the PNA to form the components of Figure IJIIb, a PNA-TNA 
hybrid containing a BBR Figure 1-IVb is a TNA to which is added the components of Figure 1- 
mb and which, under hybridiziiig conditions, binds Ae 1/2 TBR of Figure 1-IIIb to form a PNA- 
BNA hybrid containing a TBR shown in Figure 1-Vb. 
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Eigure l-IIc is a HNA vMch is added to the components of Figure l-I and which, under 
hybridizing conditions, binds the PNA to form the components of Figure 1-IIIc, a PNA-HN A 
hybrid containing a BBR. Rgurel-IVcisaTNAwhichisaddedtothecompcxientsof Fig^ 
and which, under hybridizing conditions, binds the 1/2 TBR of Figure 1-IIIc to foraa a PNA-BNA 

5 hj^rid containing a BBR shown in Figure 1-Vc. 

The hybrids which fom the TBRs and BBRs are usrful in t^ ThePNAs 
and BNAs, as indicated in Figure 1, may contain no attached support and/or indicator (OSA), or an 
attached support or other means of localization, inchading, but not limited to, attachment to beads, 
polymers, and surfaces, and/of indicators. 

0 Figure 2a is a diagram of strategies for polymerization of BNAs onto PNAs and capping 

byHNAs, 

Figure 2b is a diagram of additional strategies for amplifying PNA-TNA signals via 
polymerization of BNAs and capping by HNAs, 

Figure 3 is a dij^ram showing the use of BNAs containing multiple 1/2 BBRs per BNA. 
5 Figure4a is a diagram showing the binding of TBAs and BBAs to TBRs and BBRs, and 

the ability of the TBA to discriminate between TNAs and CNAs. According to fliis embodiment, 
if the TBA is immobilized, diher on a bead, microtiter plate surfece, or my other such surface, only 
complexes sudi as complex wcnild bo retained and detected, while complexes such as complex XI 
would not. 

Q Figure 4b is a diagram exemplifying events similar to those shown in Figure 4a but in a 

sUgJitly different order of occurrence. 

Figure 5 is a diagram exemplifying PNAs containing between one 1/2 TBR and no 1/2 

BBR to PNAs containing up to five 1/2 TBRs and one 1/2 BBR The (a) and (b) members of each 

numeral (I, II, HI, IV, V) form a set which, upon hybridization to a TN A, provide TBRs either with 
5 {(a) members) or without ((b) members) an available 1/2 BBR for amplification via hybridization 

to BNAs having complementary 1/2 BBRs. 

Figure 6a i a diagram exemplifying a particular TNA having two 1/2 TBRs which, upon 

bindii^ an apprqpiate PNA, forms two closely associated TBRs capable of binding two TBAs, A 

1/2 BBR is also provided for amplification, 
3 Figure 6b is a diagram dK>wing the same events as in Figu^ 

is used so that discrimination between smgle TBRs that occur in normal cellular samples may be 

discriminated &om abnormal, double TBRs. 

Figure 6cis a diagram showing the same scenario as in Figure 6a except that here, five 

TBRs are identified in the TNA. Each TBR may be bound to a TBA same or different, and each 
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TBA may be differentially labeled, allowing for confirmation that all five sites are present in the 
TNA. 

Figure fid is a diagram of the same events as in Figure 6c except here, a double TBA is 
shown, extending what is shovra in Figure 6b to the use of the ^ Anexan^leof theTNA 

shown in item II in Figures 6a, 6b, 6c and 6d is HIV single stranded DNA or RNA 

Figure 7 shows the HIV LTR as a TNA, and two PN As, and a strategy for detection of the 
TNA using the PNAs. 

Figure 8 is a schematic of one embodiment of the invention wherein a targ^ binding 
assembly is used to bind a hybrid TNA-PNA, and booster binding assemblies are used to bind 
polymerized BNAs. 

Kgure9 is a schematic of amodular TBA in which assembly sequences, linker sequences, 
and asymmetry sequence arc used to chaperone desired niKleic acid lecognitioii units together to 
form a TBA. 

Figure 10 shows modular TBAs useful in detection of HIV-spedfic sequaiccs. 
Figure 1 1 shows modular TBAs usefol in the detection of human papillomavirus sequences. 
Each unit of E2 is actually a dimar of the DNA binding portion of E2. 

Figure 12a is a schematic of TNA fractionation and shift in mobility due to binding of a 

TBA. 

Figure 12b is a schematic of TNA fractionation and enhanced shift in mobility due to 
binding of BB As in addition to TBAs. 

Figure 13 shows a detection strategy for deletion sequences; an example of use of this 
strategy is for a human papillomavirus integration assjy . 

Figure 14 shows assembly of higher order TBAs through use of nucleic acid recognition 
units, linker, assembly, and asymmetiy sequences such tot various Target Binding Assemblies 
specific to binding sites in the HIV LTR are fanned. 

Figure 15 shows assembly of higher order TBAs through use of DNA recognition units, 
linker, assembly, and asymmetry sequences such that various Target Binding Assembhes specific 
to binding sites in the HPV genome are fonned. 

Figure 16 shows the disoimination achieved by using a complex TBA and the ability of 
endogenous competitor target bicdiflg molecule to eliminate binding of the TBA to a cousin nucleic 
acid but not from the TNA \rfiich contains tlie appropriate orientation of more than one site 
recognized by the TBA. 
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Figure 17 shows the ability of a TBAto specifically b& targeted to bind to sites of sequence 
roismatch and to preferentially bind those sites over ccHisin sites which do not contain all of the 



5 Brief Description of the Sequences 

SEQ ID NO. 1 corresponds to Figure 5-Ia-l and shows the class I MHC NF-kB binding 

site. 

SEQ ID NO* 2 cort^onds to Figure 5 (la) and shows the B2-microglobulin NF-kB 
binding site. 

10 SEQ ID NO. 3 corresponds to Figure 5 (la) and shows the kappa immunoglobulin NF-kB 

binding site, 

SEQ ID NO. 4 corraponds to Figure 5 (la) and shows one of the HIV NF-kB binding sites. 
SEQ ID NO. 5 ccmespciKb to Figure 5 (la) and shows one of the HIV NF-kB binding sites . 
SEQ ID NO. 6 corresponds to Figure 5 (la) and shows the c-myc NF-kB binding site. 
1 5 SEQ ID NO. 7 cmesponds to Figure 5 (Ha) and shows a double HIV NF-kB binding site. 

SEQ ID NO. 8 corresponds to Figure 5 (Ha) and ^ows a double HIV NF-kB binding site. 
SEQ ID NOS. 9-16 correspond to Figure 5 (Ha) and show a double binding site with one 
site being an HIV NF-kB buiding site, and the other site beir^ an fflV SPl bmding site. 

SEQ ID NOS. 17-18 correspond to Figure 5 (Ha) and show a double HIV SPl binding site. 
20 SEQ ID NOS. IMl cmespond to Figure 5 (Ilia) and show a double HIV NF-kB binding 

site and an HIV SPl bindmg site. 

SEQ ID NOS. 32-33 correspond to Figure 5 (IVa) and show a quadruple binding site 
wh^e two sites are HIV NF-kB binding sites and two sites are HTV SP i binding sites. 

SEQ ID NO. 34 ccKTCsponds to Figure 5 Via) and stows a quinti9)le b inding site where two 
25 sites are HIV NF-kB binding sites and three sites are ffiV SP 1 binding sites. 

SEQ ID NO. 35 is an example of a 1/2 BBR, in this case the OLl, 0L2 and 0L3 elements 
of the bacteriophage lambda left op^ator, including intervening sequences. 

SEQ ID NO. 36 is an example of a 1/2 BBR, in this case the 0R3, 0R2 and ORI elements 
of the bactOTophage lambda rigjit operator, including intervening sequences. 
30 SEQ ID NO. 37 is the HIV LTR. 

SEQ ID NO. 38 is a PNA complementary to PNA of the HIV LTR. 
SEQ ID NO. 39 is a PNA complementary to a different PNA of the HIV LTR than SEQ 
ID NO. 38. 
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SEQ ID NO. 40 is a PNA complementary to part of the HIV LTR and it also contains a 1/2 
BBR and an oveihang sequence for polymerizing BNAs onto the PNA. 

SEQ ID NO. 41 is a BNA complementary to the SEQ ID NO. 40 1/2 BBR. 

SEQ ID NO, 42 is a BNA that ^vill polymerize onto the SEQ ID NO. 4 1 BNA and which, 
5 with SEQ ID NOS. 40 and 4 1, creates a Pstl recognition site. 

SEQ ID NO. 43 is a BNA that is complementary to the SEQ ID NO. 42 BNA and which 
completes a BamEL recognition site. 

SEQ ID NO. 44 is an HNA vHakh has a BamHl recognition site that will hybridize with the 
BamHl recognition site created by SEQ ID NOS. 42 and 43 to the growing polymer. 
0 SEQ ID NO. 45 is a second PNA \toh, like SEQ ID NO. 40, is complementary to part of 

the HIV LTR, but not to the same sequence as SEQ ID NO. 40. SEQ ID NO. 45 also encodes a 1/2 
BBR and an overhang which will allow polymerization of BNAs starting with a Sphl recognition 
site. 

SEQ ID NOS, 46-62 are human papillomavirus (HPV) specific PNAs which, upon 
5 hybridization with HPV sequences, fonnTBRs which bind HPV DNA binding proteins. 

SEQ ID NOS* 63-71 are NF-kB DNA recognition units for incorporation into TBAs. 

SEQ ID N0» 72 is a nuclear localization sequ^ce. 

SEQ ID NO. 73 is a SPl sequence recognition unit. 

SEQ ID NO. 74 is a TATA binding protein recognition unit. 
rO SEQ ID NOS. 75-84 arc papillomavirus E2 DNA recognition units. 

SEQ ID NOS. 85-92 arc asymmetry sequaices. 

SEQ ID NO. 93 is an arabidc^sis TATA binding protein recognition unit. 

SEQ ID NO. 94 is an HPV-16-E2-1 DNA bindii^ protein recognition unit. 

SEQ ID NO. 95 is an HPV-16-E2-2 DNA binding protein recognition unit. 
15 SEQ ID NO. 96 is an HPV-18.E2 DNA binding protein recognition unit 

SEQ ID NO. 97 is an HP V-33-E2 DNA binding protein recognition unit. 

SEQ ID NO. 98 is a bovine papillomavirus E2 DNA binding protein recognition unit. 

SEQ ID NOS. 99-102 are exemplary linker sequences. 

SEQ ID NO. 103 is an exemplary nuclear localization signal sequence (NLS). 
10 SEQ ID NOS, 104-108 are exemplary diaperone sequences. 

SEQ ID NOS. 109-116 are exemplary assembled TBA sequences. 

SEQ ID NO. 117 is a consensus NF-kB binding site. 

SEQ ID NO. 118 an HIV Tat amino acid sequence. 
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Abbreviations 
single stranded nucleic acid. 

double-stranded nucleic acid 



1 1 M 1 M ! i f 1 1 1 M binding region on nucleic acid 



n no support or indicators, or solid support, or other means of localization, 

10 including, but not limited to, attachment to beads, polymers, and surEaces, 

or indicators = OSA 

BBA boosts binding assembly 

BBR booster binding region 

BNA booster nucleic add 

15 CNA cousin nucleic add 

1/2 BBR single-stranded region which, vAen hybridized to the complementary 

sequence from an UNA or a BNA, can bind a BBA 

1/2 TBR single-stranded region of the PNA which, when hybridized to the 

complementary sequence from a TNA, can bind a TBA 
20 OSA optional si5)port or attachment, circle with box 

PNA probe nuddc acid 

TBA target binding assembly 

TBR target binding region 

TNA target nucleic add 

25 HNA Hairpin Nucleic Add 



Definitions 

It should also be understood from the disclosure which follows that wh^ mention is made 
of such terms as target binding assemblies (TBAs), booster binding assemblies (BBAs), DNA 
30 binding proteins, nucleic acid binding proteins or RNA binding proteins, what is intended are 

compositions comprised of molecules which bind to DNA or RNA target nucleic add sequences 
(TNAs) irrespective of the sp«aficity of the category of binding molecules from which they are 
derived. Thus, for example, a TBA adapted to bind to human immunodeficiency virus sequences 
may be most similar to an NF-KB transcriptional factor which typically binds DNA sequences. 
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Howcv^, as used herein, it will be understood that the TBA may be adapted for optimal use to bind 
to RNA sequences of a particular sequence composition or conformation. 

The fidelity of the detection method disclosed herein depends in large measure on the 
selective binding of TBAs and BBAs to particular nucleic acid motifs. It should be understood 
throughout this disclosure that the basis of TBA and BBA discrimination of TNAs from related 
sequences (cousin nucleic adds or CMAs) may be foimation of precise probe nucleic acid (PNA)- 
target nucleic acid (TNA) lQi)rid segments (PNA-TNA l^brids). However, the basis of 
discrimination may just as well be die formation of a particular conformation, and may not require 
the con^iete absence of mismatched-base pairing in the TNA-PNA hybrid. Accordingly, the basis 
of TBA or BBA operation should be understood throughout to depend on discrimination of my 
property unique to die TNA-PN A hybrid as opposed to any properties displayed by any PNA-OSf A 
hybrids that may be formed in a test sanq)le contacted witii a given PNA. 

Detailed Disclosure of the Inventicm 
The present invention provides a method for specifically identifying a target nucleic acid 
(TNA) in a smph thrcsigh the use of target binding assemblies (TBAs) wiuch incorporate specific 
nucleic acid binding proteins. By using probe nucleic acids (PNAs) specific to a given TNA 
sequence, and a TBA which is specific to the di^lex target binding region (TBR) formed upon 
formation of l^brid TNA-PNA sequences, a stable TBA-TNA-PNA complex is fMrned. By 
additionally providing specific ampHfiable sequences in the PNA, in addition to sequences which 
specifically contribute to the formation of the TBR recogni^ by the TBA, the binding of the PNA 
to the TNA is detected and the detection amplified Fcff this purpose, any of a number of nucleic acid 
amplification systems, including polymerase chain reaction, or die use of branched DNA, each 
branch of viiich contains a d^ectable label, may be used. In particular, a novel m^hod of 
amplification is described herein where the amplifiable portion of the PNA contains sequences onto 
which booster nucleic acids (BNAs) may be polymerized. Upon fonnation of each BNA-PNA 
l^d, a boosts binding region (BBR) is formed to which a booster binding assembly (BBA) binds 
specifically. If detectably labeled, the BBAs or BNAs provide essentially unlimited ampUfication 
of the original TNA-PNA binding event. 

According to this invention, the TNA will be understood to include specific nucleic acid 
sequences. The TBA will be understood to be any molecular assembly which can specifically and 
tightiy bind to a formed TNA*PNA hybrid. The TBA will contain one or more molecules whose 
seqxxences are sufficient to bind to the TBR. Nucleic acid bindmg dcsnains which are known can 
either be used direcfly as components of the TBA or modified according to the teachings provided 
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hereia The most readily availabk molecules with such sequences are the DNA-binding domains of 
DN A-binding proteins. Specifically, many DNA or RNA binding proteins are known which can 
eite be used directly as the known, unmodified protein, or the TB A may be a nucleic acid binding 
protein, modified according to the specific teachings provided herein. In the latter case, specific 
5 modifications that are desirable would include optimization of binding afSnities, removal of 
unwanted activities (such as niKiease activity and reorganization of the TBA in the presence of other 
molecules with an affinity for components of the TBA), optunizaticHi of selectivity of a target 
sequence over closely related sequences, and optimization of stability. 

Examples of DNA binding protons which could be used according to this invention are the 
10 DNA-binding porticHis of the transcription factor NF-kB (p50 and p65), NF-IL6, NF-AT, rel, TBP, 

flie papilloma virus' E2 protean, spl, the repressors cro and CI from bacteriophage lambda, and like 
proteins are well known proteins whose DNA binding portion has been isolated, cloned, sequenced, 
and characterized. In addition, any other DNA-bmding protein or portion of a protein that is 
necessary and sufficient to bind to a TBRhybrid or a BBR is included This includes proteins or 
15 portions of wild-type proteins with altered DNA binding activity as well as protein created with 
ahOTd DNA-binding specificity, sudi as the exchange of a DNA-binding recognition helix firom one 
protein to another. In addition, proteins which exhibit nucleic add binding and other nucleic acid 
functions, such as restriction endonucleases, could be used as the nucleic acid binding function. 
Proteins which bind to target regions in DNA-RNA hybrids as well as RNA-RNA hybrids are 
20 inchided. (See, for example, Shi 1995, DeStefano 1993, Zhu 1995, Gonzales 1994, Salazar 1993, 
Jaishree 1993, Wang 1992, Roberts 1992, Kainz 1992, Salazar 1993(b)). The binding assembhes 
may be constructed with the use of a molecule which chaperones portions of the binding assembly 
so that specific component combinations and geometries can be achieved. This molecule is 
designated terc as a PILOT. Pilots can be comprised of proteins or any combination of CMrganic and 
25 inorganic materials vAich achieve the combinatorial selection and/or to induce specific geometries 
between members of the TBA or BBAs, A chaparone is a stable scaffold upon vMch a TBA or 
BBA may be constructed such that the correct conformation of the TBA or BBA is provided while 
at the same time eliminating undesirable properti^ of a naturally occurring nucleic acid binding 
protein. As a specific example of this embodiment, a modified version of the pleiotropic 
30 tramcrqjtion factor, NF-kB, is provided using a modified bacteriophage lambda cro protein as the 

ch^)eraac. Each NF-kB binding dimer retains the picomolar binding affinity for the NF-kB binding 
site while at the same time the binding assembly presents several advantageous manufacturing, 
stability, and specificity characteristics. 
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In view of the foregoing, the various aspects and embodiments of this invention are 
described below in detail. 

L The Probe Nucleic Acids fPNAs^ and their preparation . The PNAs of tiie present 
inventiai conq}rise at least three principal parts joined together. With reference to Figure 1(1) of the 
drawings, the first part of the PNA is one or more sequences of bases, designated "1/2 TBR." With 
reference to Figure 1(1 and Ha) of the ckawings, the 1/2 TBR in the PNA is con^iementaiy to a 
sequence of interest in a sanq>le, the TNA containing a 1/2 TBR. With reference to Figure l(IIIa) 
of the drawings, the TNA, when added to the PNA under hybridizmg conditions, forms a PNA-TNA 
hybrid containing a TBR. With reference to Figure 1(1) ofthe drawings, the second part of the PNA 
is a seqiraice of bases, designated "1/2 BBR" With reference to Figure 1(1, lib, He, and IVa) ofthe 
drawings, the l/2BBRin the PNA is complementary to a 1/2 BBRcontamed in a BNA or aHNA. 
Witfareferaice to Figure IQSb, IIIc, and Va) ofthe drawings, the BNA or HNA, when added to the 
PNA under tybridizing conditions, forms a PNA-BNA hybrid or PNA-HNA hybrid, respectively, 
containing a BBR With ref^ence to Figure 10) ofthe drawings, the third part ofthe PNA is the 
OS A, desigoated by a circle with a box around it The OS A is no support and/or an indicator, or 
solid support, or other means of localization, including but not limited to, attachment to be^, 
pofymers, and surfeces and/or indicators which is/are covalaitly attadied to, cm: non-covalaitly, but 
specifically, associated with the PNA. The OSA m^ be an atom or molecule which aids in ^ 
separation and/or localization such as a solid support binding group or label which can be detected 
by various plysical means including, but not limited to, adsorption or imaging of emitted particles 
or %>it Methods for attaching indicators to oligonucleotides or for immobilizing oligonucleotides 
to solid supports are well known in the art (see Kell^ and Manak, supra, herein incorporated by 
reference). 

The PNA ofthe present invaition can be prepared by any suitable method Such methods, 
in general, wiU include oligonucleotide synthesis and cloning in a replicable vector. Methods for 
nucleic arid synthesis are well-known in the art. Whai cloned or synthesized, strand purification and 
separation may be necessary to use the product as a pure PNA. Methods of preparing RNA probes 
are well known (see for example Blais 1993, Blais 1994, which uses m vitro transcription from a 
PGR reaction incorporating a T7 RNA polymerase promoter). 

The length and specific sequence ofthe PNA will be understood by those skilled in the art 
to depend on the loigth and sequence to be detected in a TNA , and the strictures for achieving tight 
and specific bindii^ ofthe particular TBA to be used (see discussion on TBAs below). In general, 
PNAs of sequence lengths between about 10 and about 300 nucleotides in length are adequate, with 
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lengths of about 15-100 nucleotides being desirable for many of the embodiments specifically 
exemplified hereia 

It should also be understood that the PNA may be constructed so as to contain m^e than 
one 1/2 TBR and to produce more than one TBR for one or more TBAs, same or different, as weD 
5 as complex TBRs recogni^ fay novel di?)Iex and multiplex TBAs (see descriptiou below regarding 
fiiese novel TBAs) vpon hybridization of the PNAs and TNAs. Figure 5 illustrates spedfic PNAs 
which contain one or more 1/2 TBRs. Specific sequences which correspond to the 1/2 TBR 
sequences illustrated in Fi^ 5 (la, Ila, ma, IVa, and Va) are SEQ ID NOS. 1-34 (see Desraption 
of Sequences above). 

10 As Aown in Figar«s 2a and 2b, the PNA, containing a 1/2 TBR, may be hybridized with 

one or more BNAs (see description below) and the chain of BNAs polym©rized to any desired 
p(^ential length for anq)lification of tte TNA-PNA hybridization event Preferably, bet\^'een about 
0 and about 10 1/2 BBRs wiU be present in the PNA. 

As shown in Figures 6a and 6b, the PNA may contain several 1/2 TBRs, same or different, 

15 which can hybridize with several 1/2 TBRs in a TNA. Each time a 1/2 TBR in the PNA matehes 

a 1/2 TBR in a TNA, a Target Binding Region, TBR, is formed which can bind a TBA. 
Furthermore, it is not essential that all of the TBRs be on a single, contiguous PNA. Thus, in one 
embodinj^t of the invention, two diffaent PNAs are used to detect sequences on a particular TNA. 
As an illustration of this aspect of the invention, Figure 7 shows one representation of the human 

20 immunodeficiency virus (HIV) long terminal repeat (LTR). As is known in the art, the HTV LTR 

congsriscs twoNF-kB binding sites and three SPl binding sites, in close proximity, wherein NF-kB 
andSPl are known DNA binding proteins. Figure 7 provides two PNAs, PNA 1 (SEQIDN0.38) 
and PNA2 (SEQ ID NO, 39), each of which is complementary to the opposUe strand shown as a 
TNA (SEQ ID NO. 37), which shows the two NF-kB binding sites and the three SPl binding sites 

25 of the HIV LTR According to this aspect of the invention, PNAl specifically hybridizes with that 
section of the TNA shown in Figure 7 with bases underscored with a symbol, while PNA2 
specifically l^bridizBS with that section of the TNA shown in Figure 7 with bases underscored with 
an ' ^boL Each of PNAl or PNA2 may also contain sequences (indicated by the symbols "^iT 
or "*") which will hybridize with a BNAs 1/2 BBR sequences {see below). In addition, each of 

30 * PNAl and PNA2 may be differentially tagged with an OSA, such as a fluorophorc such as a 
fluorescein or a rhodamine label, which would allow confirmation that both probes have become 
bound to the TNA. If only one label or neither label is detected, it is concluded that the TNA is not 
present in the sample being tested. 
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In a further aspect of the embodiment shovm in Figure 7, a method for altering the 
specificity of the instant ass^' method is shown. By changing the length of the gap between PNAl 
and PNA2, such that the region of TNA remaimng unhybridized is alta*ed, one practicing this 
invention is able to alter the discrimination of the assay. 
5 In order to more dearly exemplify this aspect of the invention, it is necessary to emphasize 

that the TBR may have a helical structure. Thus, while PNAl creates TBRs on one ^'fece" of the 
helix, PNA2 creates a TBR on either the same or a different face of the helix, depending on the 
distam^betweeai the middle ofcach TBR (underlined in Fi^ 7). Ifthe middle ofeach binding site 
is an integral product of 10,5 bases q)art, the TBRs will be on the same side of the helix, while non- 
10 integer products of 10.5 bases apart would place the TBRs on opposite sides of the helix. In this 
fashion, any cooperativity in binding by the TBA recognizing the PNAl TBR and the TBA 
recognizing the PNA2 TBR can be manipulated (see Hochschild, A., M. Ptashne [1986] Cell 
44:681-687, showing tids effect for the binding of bacteriophage lambda repressor to two different 
operator sites located at different distances from each other in a DNA helix). As described by 
15 Pokins et aL ([1993] £MBO J. 12:3551-3558), cooperativity between NF-kB and the SPl sites is 
required to achieve activation of the HIV LTK However, for the purpose of the instant invention, 
the double NF-kB-triple SP 1 binding site motif in the HIV LTR may be taken advantage of by 
providing a single, novel binding protein capable of binding both sites simultaneously, but only if 
the spacing between the sites is geometrically feasible. This is controlled both by the structure of 
20 the selected TBA and by the PN As used. Thus, in the embodiment exemplified in Figure 7, the two 
probes m^ be used with a large enough interprobe region of single-stranded DNA remaining such 
that, even if the NF-kB and SPl binding sites are on opposite sides of the helix, the single-stranded 
region between the probes provides a sufiBciently flexible "hii^e*' so that the DNA can both bend 
and twist to accommodate the geometry of the TBA. Alternatively, a more stringent ass^ may be 
25 designed by narrowing the interprobe distance such that the DNA may only bend, but not twist 
Finalfy, the probes may be so closely spaced, or a smgle PN A used, such that the DNA can only bend 
but not twist Thus, Ihis figure exemplifies and enables the production of detection systems with any 
given desired degree of discrimination between target nucleic adds having similar sequences, but 
different juxtapositions of these sequences. 
30 In terras of a diagnostic or forensic kit for HIV, those skilled in the art would understand that 

the aforementioned aspects of this invention allow for the tailoring of the components of the 
diagnostic or forensic kit to match what is known at any given time about the prevalent strains 0 
HIV or another pathogen or disease condition. It will also be appreciated by those skilled in the art 
that, while detection of HIV infection is not the only utility of the instant invention, due to the 
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mutafaili^^ of the HIV genome, it is probably one of the most complex test environments for such a 
diagnostic. It is precisely in such a mutable environment, however, where the flexibiiity of the 
instant m^od, ixxspM with its ability to discriminate betwen very closely related sequences, may 
be most clearly appreciated In less mutable environments, some of the sophistication to which this 

5 invention is amenable need not be utilized. Thus, in a diagnostic kit for p^iUomavirus infection, 
all of the discrimination characteristics of the TBA-TBR interaction are available, along with the 
ability to amplify the signal using the BNAs and BBAs, but a single, smpk PNA, such as any one 
of SEQ ID NOS. 46-62, may be used vMdx identifies imique papillomavirus sequences, vldch also 
are known to bind to a TBA such as the papillomavirus E2 protein or truncated DNA binding 

10 portions thereof (see Hegde et al [1992] Nature 359:505-512; Monmi et al [1991] J. Virol 

65:2124-2130), 

In ^pfyiiig the instant method to the detection of a particular TNA for the purposes of 
assessing whether certain nucleic acids are present which are associated with the progression of 
melanoma, hepatoma, breast, cervical, Umg, colon, prostate, pancreatic or ovarian cancers, fee TNA 

15 may be obtained from biopsy materials taken from ra^gans and fluids suspected of containing the 

cancerous cells. For the detection of genetic deficiencies, the TNA may be (Atained from patient 
san?5les containing the affected cells. For detection of fermentation contaminants and products in 
the manufacture of food, chemical or biotechnology products or in the bioremediation of wastes, the 
TNA may be obtained from sanq>les taken at various stages in the fennentation or treatment process. 

20 For detection of food or drag pathogens or ccsitaminants, the TNA sample m^ be obtained from the 
food or drug, swabs of food or surfaces in contact with the food, fluids in contact with the food, 
processing materials, fluids and the like associated with the manufacture of or in contact with the 
food, drug, or biological samples taken from those in contact with the food or drug or the hke. 

2, The Booster Nucleic Acids fBNAsV Booster Bindmg Regions (BBRs> and their 

25 preparation . The BNAs of the present invention are comprised of at least one or more 1/2 BBRs 

coupled to an OSA. The 1/2 BBRs can hybridize to compianentaiy 1/2 BBRs contained in the 
PNA, odier BNAs or an HNA. 

With refcraice to Figure Iff, lib andlUb) of the drawings, the sin^lest BNA is comprised 
of two parts. With reference to Figure l(IIb) of the drawings, the first part of the sin5)lest BNA is 

30 a sequence of bases which is complonentary to the sequence in the PNA which is designated " 1/2 

BBR" With refeicnce to Figure l(IIb) of the drawings, the second part of the simplest BNA is the 
OSA, desi^iatcd by a drde with a box around it The OSA is no support and/or indicator, or solid 
support, or c&er means of localization, including but not limited to, attachment to beads, polymers. 
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and surfaces and/or indicators which are covalently attached to, or non-covalently, but specifically, 
associated with the BNA. 

With reference to Figure 2a(II ard IE) of the drawings, the BNA may contain more than one 
1/2 BBR sequence. The BNA illustrated in Figure 3(11) contains a sequence ^ch is 

5 con??lementaiy to the PNA iflustrated in Figure 3(1) and two other 1/2 BBR sequences. The BNA 
illustrated in Figure 3(111) contains two 1/2 BBR sequences \^ch are complemaitaiy to two of the 
1/2 BBR sequences in the BNA illustrated in Figure 3(11), plus up to "n" additional 1/2 BBRs for 
polymerization of additional BNAs, 

Under tybridizing conditions, the BNA illustrated in Figure 3(11), when combined with the 

10 PNA iflustrated in Figure 3(1), creates the PNA-BNA hybrid illustrated in Figure 3(IVa) containing 

a BBR and an unhybridized extension with two additional 1/2 BBR sequences or "booster'' 
sequences. The BBRs created by said hybridization can be identical, similar or dissimilar in 
sequence. The BBRs created by said hybridization can bind identical, sinailar or dissi^^ 
(see below). The BNAsm^ have prepared analogous^ to the PNAs. 

15 Under hybridizing conditions, the BNA-BNA hybrid illustrated in Figure 3(IVbX when 

combined with the PNA iUustrated in Figure 3(Vb), creates the PNA-BNA hybrid iQustrated in 
Figure 3(VI) containing a BBR, two additicHial BNA-BNA l^rids containing BBRs, and an 
unhybridized extension with an additional 1/2 BBR sequence, a "booster" sequence. The BBRs 
created by said hybridization can be identical, similar or dissimilar in sequence. The BBRs created 

20 by said hybridization can bind identical, similar or dissimilar BBAs (see below). The BNAs may 

be prepared in a fashion analogous to preparation of the PNAs. 

3. The Targ^ Nucleic Adds fTNAs) and their preparation . The first step in detecting and 
msphfying signals produced throu^ detection of a particular TNA according to the present method 
is the hybridization of such target with the PNA in a suitable mixture. Such l^bridization is 

25 achieved under suitable conditions well known in the art. 

The sample suspected or known to contain the mtended TNA may be obtained firom a 
variety of sources. It can be a biological sample, a food or agricultural sample, an environmental 
sample and so forth. In applying the mstant method to the detection of a particular TNA for the 
purposes of medical diagnostics or forensics, the TNA may be obtained fiiom a biopsy sample, a 

30 body fluid or exudate such as urine, blood, milk, cerebrospinal fluid, sputum, saliva, stool, lung 

aspirates, throat or genital swabs and the like. In addition, detection may be in situ (see for example 
Embretsm 1993; Patterson 1993; Adams 1994). 
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Accordingly, PNAs specific to vertebrates (including mammals and including humans) or 

to any or all of the following miowrganisms of interest may be envisioned and used according to 

the instant method: 

Corvnebactma 

Corynehacterium diphtheria 

Bacillus 

Bacilhts thutingiensis 
Pneumococci 

Diplococcus pneumoniae 
Streptococci 

Streptococcus pyogenes 

Streptococcus saltvarius 
Staphylococcus 

Staphylococcus aureus 

Staphylococcus albus 
Pseudomonas 

Pseudomonas stutzen 
Neisseria 

Neisseria meningitidis 

Neisseria gonorrhea 
faiterobacteriaceae 

Escherichia colt 

Aerobacteria aerogenes 

Klebsiella pneumoniae 

Salmonella typhosa 

Salmonella choleraesuis 

Salmonella typhimurium 

Shigellae dysenteriae 

Shigellae schmitzii 

Shigellae arabinotarda 

Shigellae fiexneri 

Shigellae boydii 

Shigellae sonnei 
Other enteric bacilli 

Proteus vulgaris 

Proteus mirabilis 

Proteus morgani 

Pseudomonas aeruginosa 

Alcaligenes faecalis 

Vibrio cholerae 
Hcmophilus-Bordetella group 

Hemophilus influenza, H. ducryi 

Hemophilus hernophilus 

Hemophilus aegypticus 

Hemophilus parainfluenzae 

Bordetella pertussis 
Pastcurellae 

Pasieurella pestis 

Pasteurella tulareusis 



The colifoim bacteria 
The Salmonellae 

The Shigellae 

Proteus species 



50 
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Bruccllae 

Brucella melitensis 
Brucella abortus 
Brucella suis 
5 Aerobic Spore-Forming Bacilli 

Bacillus anthracis 
Bacillus subtilis 
Bacillus megaterium 
Bacillus cereus 
10 Anaembic Spnrft^Fnrm mg Bacilli 

Clostridium botulinum 
Clostridium tetani 
Clostridium perfringens 
Clostridium novyi 
15 Clostridium septicum 

Clostridium histolyticum 
Clostridium tertium 
Clostridium bifermentans 
Clostridium sporogenes 
20 Mycobacteria 

Mycobacterium tuberculosis hominis 
Mycobacterium bovis 
Mycobacterium avium 
Mycobacterium leprae 
25 Mycobacterium paratuberculosis 

Actinomvcetcs (fungus-like bacteria) 
Actinomyces isaeli 
Actinomyces bovis 
Actinomyces naeslundii 
30 Nocardia asteroides 

Nocardia brasiliensis 
The Spirochetes 

Treponema pallidum 
Treponema pertenue 
35 Treponema carateum 

Spirillum minus 
Streptobacillus moniliformis 
Borrelia recurrens 
Leptospira icterohemorrhagiae 
40 Leptospira canicola 

Trypanasomes 
Mycoplasmas 

Mycoplasma pneumoniae 
Other pathogens 
45 Listeria monocytogenes 

Erysipelothrix rhusiopathiae 
Streptobacillus moniliformis 
Donvania granulomatis 
Bartonella bacillformis 

50 
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Rickettsiae (bacteria^like parasites) 

Rickettsia prowazeJdi 

Rickettsia mooseri 

Rickettsia rickettsfi 

Rickettsia conori 

Rickettsia australis 

Riclxttsia sibiricm 

Rickettsia akah 

Rickettsia tsutsugamtishi 

Rickettsia bumetti 

Rickettsia quintana 
Chlamydia (uaclassifiable parasites bacterial/viral) 

Chlamydia agents (naming uncertain) 

Fungi 

Cryptococcus neoformans 
Blastomyces dematidis 
Histoplasma capsulatum 
Coccidioides immitis 
Paracoccidioides brasiliensis 
Candida albicans 
Aspergillus fumigatus 
Mucor corymbifera {Absidia corymbifera) 
Rhizopus oryzae 

Rhizopus arrhizus Phycomycetes 

Rhizopus nigricans 

Sporotrichum schenkii 

Flonsecaea pedrosoi 

Fonsecaea compact 

Fonsecacae dermatidis 

Cladosporinm carrioni 

Phialophora verrucosa 

Aspergillus nidulans 

Madurella mycetomi 

Madurella grisea 

Allescheria boydii 

Phialophora jeanselmei 

Microsporum gypsum 

Trichophyton mentagrophytes 

Keratinomyces ajelloi 

Microsporum canis 

Trichophyton rubrum 

Microsporum adouini 

Viruses 

Adenoviruses 
Herpes Viruses 

Herpes simplex 
Varicella (Chicken pox) 
Herpes zoster (Shingles) 
VirusB 

Cytomegalovirus 
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Poxviruses 

Variola (smallpox) 

Vaccinia 

Poxvirus bovis 
5 Paravaccinia 

Moiliiscum contagiosum 
Picomavimses 

Poliovirus 

Coxsackievirus 
10 Echoviruses 

Riunoviruses 
Mvxoviruses 

Influenza (A, B, and C) 

Parainfluenza (1-4) 
15 Mumps virus 

Newcastle disease virus 

Measles virus 

Rinderpest virus 

Canine distemper virus 
20 Respiratory syncytiai virus 

Rubella virus 
Arboviruses 

Eastern equine encephalitis virus 

Western equine encephalitis virus 
25 Sindbis virus 

Chikugunya virus 

Semliki forest virus 

Mayora virus 

St. Louis em^halitis virus 
30 California encephalitis virus 

Colorado tick fever virus 

Yellow fever virus 

Dengue virus 
Reoviruses 
35 Reovirus types 1-3 

Retroviruses 

Human immunodeficiency viruses (HIV) 

Human T-ccll lymphotrophic virus I & n (HTLV) 
Hepatitis 

40 Hepatitis A virus 

Hepatitis B virus 

Hepatitis nonA-noaoB virus 

Hepatitis, C, D,E 
Tumor viruses 
45 Rauscher leukemia virus 

(jross virus 

Maloney leukemia virus 
Human papilloma viruses 
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It ^vould be understood by caie of skill in the art that it is generally required to treat samples 
suspected of containing a particular TNA in such a fashion as to produce fragments that can easily 
hybridize widi the PNA. It may be necessary to treat the test sample to efifect release of or to extract 
the TNA for irjtelization, such as by exposing blood or other cells to a hypotonic environment, or 
5 otheiwise disrupting the san5)le using more vig(HOusnieans^ Wten the TNA is thought to be present 
in double stranded form, it would naturally be desirable to separate the strands to render the TNA 
l^todizabk in single stranded fonn by methods weU know^ 

heating or limited exposure to alkaUne conditions which may be neutralized upon addition of the 
single stranded PNA to allow hybridization to occur. Methods for preparing RNA targets are well 

10 known (see Waterhouse 1993, Mitdiell 1992). 

Fragmentation of nucleic acid samples containing TNAs is usually required to decrease the 
sanqsle viscosity and to increase the accessibility of the TNAs to the PNAs. Such fragmentation is 
^sccxnplished by random or specific means known in the art Thus, fOT example, specific nucleases 
known to cut with a particular frequency in the particular genome being analyzed, may be used to 

15 produce fragments of a known average molecdar size. In addition, other nucleases, 
plK)sphodiesterases, exonudeas^ and endonuclcases, physical shear and sonication are all methods 
amenable for this purpose. These processes are well known in the art. The use of restriction 
enzymes forthepurposeofDNA fragmentation is generally prefCTred However, DN A can also be 
fragmented by a variety of chemical means such as the use of the following types of reagents: 

20 EDTA-Fe(II) (according to Stroebel et al [1988] 1 Am. Chem Soc. 110:7927; Dervan [1986] 
Science 232:464); Cu(n3-phenanthroline (according to Ossxl and Sigman [1987] Science 237: 1 197); 
class nS restriction enzyme (according to Kim et al [1988] Science 240:504); hybrid DNAse 
(according to Corey et al [1989] Biochem. 28:8277); bleomycin (according to Umezawa et al 
[1986] J. Antibiot, (Tokyo) Ser, A, 19:200); neocarzinostatin (Goldberg et al [1981] Second 

25 Annual Bristol-Myers Symposium in Cancer Research, Acsidissmc Press, New York, p. 163); and 

mdhidmn]piopyl-EDTA-Fe(II) (according to Hertzberg et al [1982] 1 Am, Chem. Soc 104:3 13). 
Removal of proteins, as by treatment with a protease, is also generally desirable and methods for 
effecting protein removal from nucleic acid samples, without appreciable loss of nucleic acid, are 
well known in the art, 

30 The TNAs of the present invention should be long enough so that there is a sufficient 

amount of double-stranded hybrid flanking the TBR so that a TB A can bind unpemirbed by the 
unligated fragment ends. Typically, fragments in the range of about 1 0 nucleotides to about 1 00,000 
nucleotides, and preferabfy in the range of about 20 nucleotides to about 1,000 nucleotides are used 
as the average size for TNA fragments. Examples of specific TNA sequences that could be detected 
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are sequences complemeiitaiy to the PN A sequences described herein for detection of normal 
cellular, abnonnal cellular (as in activated oncogenes, integrated foreign genes, genetically defective 
genes), and pathogen-spedfic niKsIeic acid sequences, for which specific nucleic acid binding proteins 
are known, or which can be produced according to methods described in this disclosure. With 
5 reference to Figure 7, a specific HlV-related TNA is shown as SEQ ED NO. 37. 

4. Extensioais to the PNA using BNAs. their preparation, and signal amplification . Under 
lybridizing oonditicHis, BNAs can be added that hybridize to the PNAs, PNA-BNA hybrids, BNAs 
and/or BNA-BNA l^rids. The afOTsnendoned additions can be made in a non-vectorial polymaic 
Q fashion or in a vectorial fashion, with a known order of BNAs. 

7^ 10 With reference to Figure 2a, a simple booster is presented A boosts polymer is produced 

f J by adding two BNAs, illustrated in Figure 2a(Ib and Ic), which when combined under hybridizing 

|l conditions with the PNA, fonn PNA-BNA-BNA hybrids, comprised of the PNA and "booster" 

extensions", illustrated in Figure 2a(IIa,IIbJIc and lid) leaving at least one unpaired 1/2 BBR 
J, sequence. Each ui^)aiied 1/2 BBR sequence, illustrated in Figure 2a(na, lib, He, Hd) can l^^bridize 

'T; 15 with additional BNAs to form additional "boc^er'' extensions. Each unpaired 1/2 BBR sequence, 
r[| illustrated in Figure 2a(IIaJIb4Ic and Ed) can hybridize with added HNAs, illustrated in Figure 

;;J 2a(IIIa and Mb). hybridization of the HNAs, which cannot hybridize additional BNAs, acts to 

'^1 ^'cap" the addition of the BNAs onto the PNA, as illustrated in Figure 2a(IV a, IVb, IVc and IVd). 

With refeence to Figure 2b, it is possible to contrcA and specify the order and CQn^)onents 
20 of extensions to the PNA. If a single BBR is required, a HNA containing the complementary 

sequence to the 1^ BBR in the PNA is added to the PNA to produce a single BBR and to "cap" any 
"booster" extensions to the PNA. If additional BBRs are to be added to the PNA, a controlled 
extension of the PNA can be accomplished. 

With refom^ to Figure 2b, a simple booster is presented. Vectorial polymer extension is 
25 acccmqiUshed by adding a BNA which is spedfic for the PNA, as illustra^ m Figure 2b(Ia and Ha), 
\^ch whra combined under hybridizing conditions with the PNA, form PNA-BN^^ 
ccmprised of the PNA and "boosta:^' extensions. These extensions, if labeled with an OSA, provide 
a metlK)d for great^ amphfymg any signal produced vpcm binding of a PNA to a TNA in the sample. 
Furthermore, by binding labeled BBAs to the BBRs in the polymer, additional amplification is 
30 achieved. 

Any of a number of methods may be used to prepare the BNAs, including, e.g., synth^is 
via known diemistiy or via recombinant DNA production methods. In the latter method, an 
essentially unlimited number of BNAs may be produced simply and inexpaisively, for example, by 
production in prokaryotes {£. coli for example) of a plasmid DNA having multiple repeats of the 
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specific BNA sequences flanked by restriction sites having overhanging ends. In this fashion, for 
example, the bacteriophage lambda left or right operator sites, or any other DNA or other nucleic 
acid sequence known to specifically and tightly binid a particular BBA, such as a DNA or RNA 
bmding protein, may be produced in an essentially unlimited number of copies, with each copy 
5 flanked by an EcdSl, Pstl BamHl or any of a number of other common restriction nuclease sites. 
Alternatively, a polymer at repeated sites m^ be excised by unique restriction sites not present 
within the polymer. Large quantities of pBR322, pUC plasmid or other plasmid having multiple 
copies of these sequences are produced fay methods well known in the art, flie pi asmid cut with the 
O restrktkm enzyme flanking the polymerized site, and the a 

lo 10 isolated dther by dm>matogr^ or aiy other convenient means kno The BNA, prior 

0 ' to use is then strand separated and is then amenable for polymerization onto a PNA encoding a 

single stranded con^)lenKntary copy of the operator as a 1/2 BBR. The BNAs may be polymerized 
% vectorially wito the PNA by using different restriction enzymes to flank each repeat of the polymer 

/ inflieplasmidusedtoproducemuitiplecopiesoftheBNA. Alternatively, the BNA polymer may 

J 15 be hybridized to the PNA via overhangs at one or both ends of die BNA polymer, without the need 

fIJ to strand separate and anneal each BNA segment Examples of specific BNA sequences are 

^1; provided above in the section entitled Description of Sequences, as SEQ ID NOS. 35-36. To 

'-=1 stabilize the BNA polymer, DNA Ugase may be used to covalently link the hybridized BNAs. 

5. The Hairpin Nucleic Acids fHNAs) and their preparation The HN As of the present 
20 invention comprise at least two principal parts joined together: A single-stranded sequence, \rfuch 

is complementary to a 1/2 BBR, and a double-stranded nucleic acid region formed, under Iwbridizing 
conditi(»is, by the self-association of self-complementary sequences within the HNA. With reference 
to Figure l(IIc) of the drawings, die 1/2 BBR in the HNA m^ be constructed so as to be 
complementary to the 1/2 BBR sequence in the PNA. With reference to Figure 1(1, He and nic) of 
25 the drawings, the aforementioned HNA, v*en added to the PNA under hybridizing conditions, forms 
a PNA-HNA hybrid containing a BBR. With reference to Figure l(nic, IVc and Vc) of the 
drawings, a PNA-HNA hybrid, under hybridizing conditions, i^n addition of the TN A, can form 
a TN A-PNA-HNA hybrid containing a TBR and a BBR. 

With reference to Figure 2a and 2b, the HNAs can be used to "cap" or terminate the addition 
30 of BNA extensions to the PNA. The two BNAs in Figure 2a(rb and Ic) can associate to form the 

hybnd shown in Figure 3(IVb) or can hybridize directly and individually to the PNA as illustrate 
in Figure 2a(Ia-c, Ila-d). The two HNAs (shown in Figure 2a(IIIa and Illb)) can terminate the 
hybridization of the BNA to other BNAs which extend from the PNA, as illustrated in Figure 2a 
(IVa-dX The HNA in Figure 2a(IIIa) can terminate the PNA-BNA hybrids shown m Figure 2a(IIb 
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aad nd) and am^ PNA-BNA hybrid with a single stranded 1/2 BBR which is complementary to the 
1/2 BBR in the HNA iUustrated in Figure 2a(ffla). Similarly, the HNA in Figure 2a{nib) can 
terminate fee PNA-BNA hytmds diown in Figure 2a(IIa and lie) and aity PNA-BNA hybrid with two 
single stranded 1/2 BBRs which are complementary to the 1/2 BBRs in the HNA iUustrated in 
Figure 2a{IIIb). 

HNAs are constructed that will terminate PNA-BNA hybrids which are constructed j&om 
the sequential addition of BNAs to the PNA as illustrated in Figure (2b). The single stranded 1/2 
BBR sequences illustrated in Figure 2b(Ia. Ilia, Va, and Vila) are specifically complementary to the 
single stranded 1/2 BBR sequences illustrated in Figure 2KIi> JIIb,Vb and Vllb) and produce the 
unique capped PNA-BNA-HNA hybrids illustrated in Figure 2b(IcJIIc,Vc and VIIc). 

The self-ccxi?>lementary sequoaces in the HNA and the loop sequence which links the self- 
complementary hairpin sequences can be of any oonqwsition and length, as long as they do not 
substantially impede or inhibit the presentation of the single-stranded 1/2 BBR that comprises part 
of the HNA by the HNA or selectively bind the BBA or the TBA. The loop sequences should be 
selected so feat foraiaticKi of the loop does not impede formation of the hairpin. An 6»£ampies of an 
HNA useful in this q>plication is provided as SEQ ID NO. 44 (see Description of Sequences above), 

6. The Target Binding Assemblies (TBAs) and their preparatioa A TBA may be any 
substance which binds a particular TBR formed by hybridization of particular TNAs and PNAs, 
provided that the TBA must have at least the following attributes: 

(a) The TBA must bind the TBR(s) in a fashion that is highly specific to the TBR{s) 
ofinterest That is, the TBA n^istdisominate between TBRspi^ent in t^ 
PNA hybrid and similar duplex sequences formed by PNA-Q^A hybrids. The 
TBA must bind th%PNA-CNA hybrid with a sufficiently low avidity that upon 
washing the TBA-TN A-PNA complex, the PN A-CNA hybrid is displaced and the 
PNA-TNA hybrid is not displaced; 

(b) The TBA must avidly bind the TBR(s) created by the hybridization of the TNA 
with the PNA. Binding affmities in the range of lO"^ to about 10'^^ or higher are 
generally considered sufficient As noted below, in some instances, it might be 
desirable to utilize a particular TBA which has a very low avidity for a particular 
TBR, but which has a greatly increased affinity when a particular configuration of 
multiple TBRs is provided so that the square of the affinity of the TBA for each 
TBR becomes the affinity of relevance to that particular TBA. 

Exarqjles of the DNA binding components usefol in the formation of TBAs include, but are 
not limited to NF-kB, papillomavirus E2 protein, transcription factor SPl, inactive restriction 
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enqroes, antibodies, etc. Each of tbese proteins has been recognized in the art to contain sequences 
which bind to particular nucleic acid sequences and the affinities of these interactions are known. 
Naturally, tie method of die instant invention is not limited to the use of these known DNA binding 
proteins or fragments thereof. From the instant disclosure, it would be apparent to one of ordinary 
ddn that the instant method could easily be applied to the use of novel TB As exhibiting at least the 
required attributes noted above. Thus, for example, in WO 92/20698, a sequence specific DNA 
binding molecule comprising an oligonucleotide conjugate formed by the covalent attachment of a 
DNA binding drug to a triplex forming oligonucleotide was desmbed. The method of that 
disclosure could be used to produce novel TBAs for use according to the instant disclosure, provided 
thattheTBAs thus fomiedme^tbe criteria describedabove. In addition, the methods of U.S. Patent 
Nos. 5,096,815, 5,198,346, and WO88/D6601, herein incorporated by reference, may be used to 
generate novel TBAs far use according to the method of this invention. Specific antibodies or 
portions thereof could be used (see for example Blais 1994). 

Where the TBA is a protein, or a con^lex of proteins, it will be recognized that ai^ of a 
number of methods routine in the art may be used to produce the TBA The TBA m^ be isolated 
from its naturally occurring environment in nature, or if this is impractical, produced by the standard 
techniques of molecular biology. Thus, using NF-kB as an exanq)le, using die DNA binding 
pordons of p50 CH'p65 subunits, this binding assembly could be produced according to recombinant 
methods known in the art (see for cxan?>le Ghosh [1990] Cell 62: 1019-1029, describing the cloning 
of the p50 DNA binding subunit of NF-kB and the homology of that protein to rel and dorsal). 

Msny DNA and other nucleic acid binding proteins are known which can be used as or in 
TBAs according to this invention. Once the ammo add sequence of any DNA, RNA:DNA, RNA 
OT other nucleic 2tcid binding protein is known, an appropriate DNA sequence encoding the protein 
can either be prepared by synttetic means, or a cDNA copy of the mRNA encoding the protein from 
an appropriate tissue source can be us«L Furthermore, genomic copies encoding the protein may 
be obtained and introns spliced out accordmg to methods known in the art. Furthermore, the TBAs 
may be chemically synthesized. 

Once an appropriate coding sequence has been obtained, site-directed mutagenesis may be 
izsed to alter the amino acid sequence encoded to produce mutant nucleic acid binding proteins 
exhibiting more desirable binding characteristics than those of the original nucleic acid binding 
protein. As an example of this process, the amino acid sequence of the DNA binding portions of 
NF-kB can be altered so as to produce an NF-kB' molecule which more tightly bmds the NF-kB 
binding site (see examples below - HIV-Detect and HIV-Lock). 
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To iTOvide forflier insight into this aspect of the invention, the foiiowing considerations are 
tobenoted Using W-kB as an example, a TBA may be prepared using the nato 
kB molecule. However, because this molecule is present in vanishingly small quantities in cells, and 
because the subunits of this DNA binding protein have been cloned, it would be more reason^le to 
5 prepare large quantities of the complex via recombinant DNA means as has already been 

accomphshedforthis protein (see forexample Ghosh [1990] Ce// 62:1019-1^^^ NF-kB is a 
pldolropic inducer of genes involved in immune, inflammatory and growth regulatory responses to 
primary pathogenic (viral, bacterial or stress) challenges or secondary pathogenic (inflammatory 
cytddne) challenges. NF-kB is a dimmc DNA binding protein comprising a p50 and a p65 subunit, 
10 both of\si!ich contact and bind to specific DNA sequences. In an mactivated state, NF-kB resides 

in the ceUular cytoplasm, complexed with a specific inhibitor, I-kB, to form a cytoplasmic 
heterotrimer. Upon acdvation, the inhibitor is decompiexed, and the p50-p55 dimer relocates via 
a specific nuclear localization signal (NLS) to the cell's nucleus whore it can bmd DNA and effect 
its role as a transcriptional activates: of numerous genes (see Grimm and Baeuerle [1993] Biochem. 
15 1 290:297-308, for a review of the state of the art regarding NF-kB). 

The p50-p65 dimer binds with picomolar affinity to sequences matching the consensus 
GGGAMTNYCC (SEQ ID NO. 117), with sli^tly different affinities pending on the exact 
sequence. It is worth noting that homodimers of p50 and p65 have also been observed to occur. 
TiKse homodimOT display different biochemical properties as well as sli^tly different affinities of 
20 binding sequences within and similar to the above consensus. Thus, depending on the desired 
binding characteristics of the TBA, a p50-p65 heterodimer, a p50-p50 homodimer, or a p65-p65 
homodimer or firagments of the aforementioned dimers may be used 

One w^ in \rfjich various novel TB As may be produced for use according to this invention 
is shown schematically in Figure 9. The nucleic add recognition units of the TBA may be assembled 
25 and associated with similar c»: dissimilar TBA nucleic acid recognition units via a "chaperone." The 
diaperone is a structure on which the various TBA recognition elements are built and which confers 
desirable properties on the nucleic acid recognition units. The chaperone is comprised of any 
sequence which provides assembly sequences such that same or different nucleic add recognition 
units are brought into close and stable association with each other. Thus, for example, in the case 
30 of a TBA designed to tightly bind NF-kB TBRs, a TBA is assembled by providing lambda cro 
seqiCTces as assembly sequerK:es, linked to the nucleic acid binding sequences for cither NF-kB p50 
or p65. The p50 orp65 nucleic acid binding sequences are linked to the cro sequences at either the 
carboxy or amino terminus of cro and either the carboxy or amino terminus of the nucleic acid 
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recognition unit of the p50 or p65. Linking sequences are optionally provided to aliow appropriate 
spacing of the imclcic acid recognition units for optimal TBR binding. 

The assembly seqisnces, exemplified above by cro and CI sequ^ces (SEQ K> NOS. 104- 
108), comprise aiQrstabkoHgopeptideswMchnaturaUy and strongly bond to Thus, 
in the case of cw, it is well known that a dimer of cro binds to the bacteriophage lambda operator 
sites (Anderson al [1981] Uatun 290:754-758; Hairison and Aggarwal [1990] Am, Rev, 
Biochem, 59:933-969). The monomer units of cro tightly and specifically associate with e^ other. 
Thus, by linking DNA recognition unit sequences to the cro sequences, close and tight association 
is achieved. 

The cptiooal linker sequences comprise any amino acid sequence \^ch does not mterfere 
with TB A assembfy or nucleic acid binding, and which is not labile so as to Uberate the nucleic acid 
recognitioii unit ficm the complete TBA. It is desirable but not necessary that the linker sequences 
be covalently linked to other binding assembly components. The association should be specific so 
as to md in the assembly and manufacture of the binding assemblies. Examples of such sequences 
include, but arc not limited to, such well known sequences as are found linking various domains in 
structural proteins. Thus, for example, in the lambda repressor protein, there is a linking sequKice 
between the DNA binding domain and the dimerization domain which is useful for this purpose. 
Many other such sequences are known and the precise sequence thereof is not critical to this 
imrentico,pK>vided that routine experimentation is conducted to ensure stabiUty and non-interference 
with target nucleic ^d binding. Examples of such sequences are provided herein as Met Ser and 
SEQ IN NOS. 99-102. Insertion of specific, known proteolysis sites into these linkers is also an 
integral part of this invention. The presence of such sites in the linker sequences would provide 
manufecturing advantages, allowing different molecules to be assembled on the chaperone scaffold. 

in addition to the nucleic add recognition units, optional linking sequaices, and assembly 
sequences, the novel TBAs of this invration optionally have asymmetry or PILOT TNA sequences 
and one more OSA units. The asynaraetiy sequences are provided to encourage or prevail certa^ 
desirable or undesirable associations. For example, m the event that a TBA having homodimeric 
p50 DNA recognitioa units is desired, the a^imnetiy sequences are provided to disrupt the naturally 
stronger association of NF-kB p50 subunits and p65 subunits, while not disrupting the assembly 
sequences firom bringing together p50 subunits. Examples of such sequences are provided h^in 
asSEQIDNOS.85.92andSEQIDNOS. 105andl06, 

In a different configuration, NF-kB p50 subunit sequences are brou^ mto close association 
with transcription factCM* SP 1 DNA recognition unit sequences. This is desirable m the event that 
an NF-kB/SPl binding motif is of significance, as in the HIV LTR where a motif of at least six DNA 
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binding protein recognitioa sites, two NF-kB, three S? 1, and a TATA site are known to exist. Since 
it is ako known that the second NF-kB and first SPl site are significant to regulation of HIV 
transcription (Perkins «r i?/. [1993] Embol 12:3551-3558), tiiis particular configuration of TBA 
is uscfiii not onfy in the detectioii of HIV, but as a therapeutic or prophylactic against HIV infection 

5 (see below). In a sunilar fiisMon, the long control region (LCR) of human papillomavirus may be 
used as a key control region for probing according to this method 

In view of the different elements that can be associated, cassette fashion, according to this 
method of TBA formation, an essentially unlimited varied of TBAs are produced In Figure 10, 
a series of different molecules, referred to as "HIV-detect I-IV' are exemplified wherein "CHAP" 

0 denotes the cfc^one, "nfkb*' denotes NF-kB subunits, '^spl*' denotes the nucleic acid recognition 
imit of the SPl transcription factor, and "TATA" denotes a dimer of the DNA recognition unit of 
a TATA sequence DNA binding protein (TBP), also known as a TATA binding protein, or TBP. 
These configiffadons are fcrther exen^^lified below and are all integral parts of the instant invention. 
In y^ another configuradon, the modular structure shown in Figure 9 is adapted to detection 

5 and or treatment or prc?)hylaxis of a canpletelydiff^ In Figure 11, in a similar fashion 

to the above described "HIV-detect I-IV" molecules, a sedcs of '*HPV-Detect I-IV" nsolecules is 
produced In this onbodiment, advantage is taken of the DNA binding properties of the E2 protein 
of human papillomavirus (HPV), In addition, the roles of SPl and TBP are taken advantage of by 
providing specific DNA recognition units adapted to bind to these sequences in the HPV genome. 

0 In the fonnation of the BZ-spcdHc TBAs for use in detecting HPV infection, it may be desirable to 

use any of SEQ ID NOS. 75-84 or 93-98 as the E2 DNA recognition units, A TBA containing a 
bovine E2 dimer and a human E2 dimer DNA binding domain ms^ be particularly useful. 

The various sequences described above may either be chanicaUy linked using pure 
oligopeptide starting mataials, or they may be linked through provision of reccsnbinant nucleic acids 

5 encoding, via the well known genetic code^ the various subelements. In the event of recombinant 

production, linking cro coding sequences to sequence of nucleic acid recognition units to fonn 
TBAs is advantageous because not only does cro act as assembly sequences in the chap^one, it also 
acts to direct the proper folding of the nucleic acid recognition clemaits. Exemplary sequences fc»r 
chaperones are provided herein as SEQ ID NOS, 104-108. Furthermore, in the event that higher 

0 order structures comprising multiple binding sites is desired, as in a pentameric NF-kB/NF- 
kB/SPl/SPl/SPl TBA, proper design of the asynanetry sequences allows such structures to be 
made. 

In the foregoing fashion, TBAs are prepared which bind t» their cognate binding sites with 
high affinity. For ejcample, the NF-kB DNA binding components of the TBAs of Figure 10 are 
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expected to bind to the HIV-LTR\rith an afiOmty of between about 1^ lO'^^molar. Sequences 
useful as the DNA recognition units are provided as SEQ ID NOS. 63-71, 73-84, 93-98, and 104- 
108 and exemplified further below. 

In view of the forgoing description of directed assembly of nucleic acid binding proteins 

5 using assembly and asymmetry (or piloting) sequences, those skilled in the art vn!l recognize that 
a generdfy^licablem^od for assembling protein structures is provided by thi^ The 
generality of this mcAhod is demmstrated fiarflier by consideration, by way of further example, of the 
use of an antibod^-epitope interaction in the assembly of desired structures. By w^ of specificity, 
aDNA bindir^iKOteinstrudurem^ be assembled by linking an NF-kB p50 subunit to an antigen, 

10 such as a drcuiarized (throi^ disulfide bonds) melanocyte stimulating hormone (MSH). This pro- 
MSH molecule m2^ then be bound by an anti-MSH antibody to provide a novel nucleic acid binding 
assembly, with the antigaa and antibody acting as assembly sequences. 

The modular structure provided by Figure 9 reveals that a great variety of TBAs may be 
assembled using different COTibinations of components. Thus, representative anbodiments of this 

15 general structure are provided as SEQ ID NOS. 109-1 16. 

7. The Booster Binding Assemblies fBBAs^ and their preparation . A BBA may be any 
substance which binds a particular BBR formed by hybridization of particular PNAs and BNAs, 
including when multiple BNAs (up to and including "n" BNAs, i.e., BNA^ , wherein "n" is 
theoretically 0-co, but practically is betu'een about 0 and 100) are polymerized onto the PNA for 

20 signal amplification, provided that the BBA must have at least the foUowmg attributes: 

(a) The BBA must bind the BBRs in a fashion that is highly specific to the BBR of 
interest. Thai is, the BBA must discriminate between BBRs present in tiie PNA- 
BNA hyteid and similar duplex sequences in BNA-CNA hybrids or other CNAs. 
Thus, where even a single base mismatch or conformational differences with or 

25 without base mismatches occur in the production of the PNA-BNA„ or PNA- 

BNAjfHNA hybrid, tiie BBA must bind the hybrid mih a suflQcicntly low avidity 
that ip^a washing the TBA-ThJA-PNA-BNA^con^jlex, the BBA is displaced from 
the CNA sequences but not the BBR sequences. 

(b) The BBA must avidly bind the BBR(s). Binding affinities in the range of 1 0'^ to 
30 about 1 0^^ or higher are generally considered sufficient. 

Examples of BBAs include, but are not limited to cro, and the bacteriophage lambda 
repressor prc^in, Q. In addition, see U.S. Patent No. 4,556,643, herein incorporated by reference, 
which suggests other DNA sequences and specific binding proteins such as repressors, histones, 
DNA modifying enzymes, and catabolite gene activator proteux. See also EP 0 453 301, herein 
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incorporated by reference, which suggests a mnltitade of nucleotide sequence specific binding 
protdns (NSSBPs) such as flie tetracycline rq)ressar, the lac repressor, and the tryptophan repressor. 
Each of these BBAs has been recognized in the art to bind to particular, known nucldc add 
sequences and the affinities of these interactions are known. Naturally, the method of the instant 

5 invention is not limited to the use of these known BBAs, From the instant disclosure, one of 

ordinary skill could easily apply the use of novel BBAs exhibiting at least the required attributes 
noted above to the instant method. 

Examples of novel BBAs usefid according to this aspect of the invention include novel 
proteins based on tte motif of a known DNA or RNA or DNA:RNA binding protein such as cro or 

10 the A. CI repressorprotein. Preferably, such modifications are made to improve the handUng of these 
components of the invention. Thus, it may be desirable to add a high concentration of cro to an 
ass^. One of the negative qualities of cro is that at hi^ concentrations, the binding of cro to its 
DNA target comes into conqjetitiwi with cro-cro interactions. Thus, for example, a chapo'oned or 
mutated cro may be produced which does not have this shortcoming. Exan[q)les of such alta^ 

15 ch^)aones are SEQ ID NOS. 105-106 and 108, Methods known in the art, such as production of 

novel target binding proteins using variegated populations of nucleic acids and sdec^on of 
bacteriophage binding to particular, pre-selected targets (/.e., so-called phage-display technology, 
see discussion above for production of novel TBAs) m^ be used to produce such novd BBAs as 
well as the afor^nentioned novd TBAs. 

20 Where the BBA is a protein, or a complex of proteins, it will be recognized that any of a 

number of methods routine in the art may be used to produce the BBA. The BBA may be isolated 
from its naturally occurrii^ ^vironment in nature, or if this is impractical, produced by the standard 
techniques of molecular biology. Thus, for example, the sequence of the cro protein is known and 
any molecular clone of bacteriophage lambda may be used to obtain appropriate nucldc acids 

25 encoding cro for recombinant production thereof In addition, the TBAs described herein may be 

used as BBAs, provided that different TBAs are used to bmd TBRs and BBRs, 

8. The use of BBAs and BBRs to localize and amplify the localization of the PNA-TNA- 
TBA complexes (see Figure 8). hi one embodiment of this invention, the highly specific and 
extreme^ tight binding of TBAs comprised of nucleic acid binding components is used to produce 

30 an amplifiable nucleic acid sandwich assay. According to one aspect of this embodiment, a solid 
siq)port is coated with a first TBA creating an immobihzed TBA, hi solution, a PNA and TNA are 
ccHJtacted under hybridiang conditions and then contacted with the immobilized TBA. Only those 
PNA-TNA interactions which form the specific TBR recognized by the immobilized TBA are 
retained upon wash-out of the solid surface which binds the TBA-TBR complex. 
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Detection of the bound TBR is accomplished through binding of Booster Nucleic Acids, 
BNAs, to the 1/2 BBRs pr^ent on the PNAs under l^bridizing conditions. In this manner, even if 
only a single TBA-TBR complex is bound to the immobilized TBA, a large, amplified signal may 
be produced by polymerizing multiple BNAs to the inmobiUzedTN Each BNA which binds to 

5 theTNA forais aBBR whichcanbeboundbyBBAs vMch, like the TBAs immobilized cai the soHd 
surface, may be chosen far their very tight and specific binding to particular nucleic acid structures . 
Thus, according to this embodiment, die immobilized TBA may contain the DNA binding portion 
of NF-kB, which very specificaDy and tightiy bmds to NF-kB bmding sites formed xspcm 
hybridization of the TNA and PNA to form suA a site. 

10 Because it is weU known that there are NF-kB binding sites botii in the normal human 

genome and m the long temml repeats of human immunodeficiency virus (HIV), this invention 
provides a mefliod of discriminating b^een the *'normal" human sites and tiie sites present in cells 
due to HIV infection. Therefore, in a test designed to determine liie presence or absence of HIV 
DNA in a san^le of human Dl^ die HTV NF-kB binding sites may be view^ 

15 normal human NF-kB binding sites may be viewed as CNAs. According to tiie method of this 
invention, discrimination between these TNAs and CNAs is accomphshed by takii^ advantage of 
the fact that in the HIV LTR, there are twoNF-kB binding sites, followed by tiiree SPI sit^ (see, for 
example,Kokenefa/. [1992] Rro/o^y 191:968-972), while ceUularW-kB binding sites with the 

same sequences are not found in tandem. 

20 In cases vhere the TNA contains more than one 1/2 TBR and it is desirable to pursue the 

tiierapeutic and prophylactic apphcations of the TBAs, it may be desirable to use more than one 
TBA, each with the edacity to bind a TBR in the TNA-PNA complex. In this case, it may be 
advantageous to select, as components of the TBAs, DNA-binding or RNA-binding domains with 
lesso-affinilyfOTitsTBR&anthewild-typeDNA.bindingorRN^ Giventiiatthe 

25 TBAs which are involved in the binding to the multiple TBRs can eifljc^ assemble together befiwe 
binding to tiieir TBRs or assemble together after binding to tiieir TBRs, the individual TBAs will 
not block the cmespoirfing TBRs in the oflior genomes than tiie target genome unless tiie TBRs are 
spatiaify enable of binding flie assembled TBA complex. One feature of tiie multimeric assembly 
of TBAs which is specifically claimed here as part of tiiis invention is tiiat such a multimmc 

30 assembly is e?q)ected to have a much reduced affinity for a single site within the TNA. However, 

since flie binding is dramatically mcreased relative to any one TBA, die TBA complex would be 
expected to not compete for die binding of any single TBR widi die corresponding native proteins 
in situ but bmd tightiy to sequences in die PNA-TNA hybrid containing die TBRs for eadi of die 
nucleic acid-binding components assembled in die TBA. The TBA complec should be assembled 
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and Mm adjusted in the individual TB As so as to allow the nucleic acid-binding regions contained 
in the TBA complex to simultaneously reach and bind to these targets. 

Once the TNA-PNA hybrids have formed and been contacted with die immobilized TBA, 
unbound nucldc add is washed from the immobilized surface and die immobilized hybrids detected. 
5 This is accQn^lished in any me of several ways. In one aspect of this invention, the PNA is labeled 
with an OSA, such as a radionuclide, colored beads, or an enzyme capable of forming a colored 
reaction product. Furthermore, in addition to having one or more 1/2 TBRs, the PNA also may 
contain at least one 1/2 BBR, Thel/2BBR sequences are chosen so as to be con5)lcmentaxy to 
unique 1/2 BBR sequences in BNAs. hi the embodnncnt described above, for example, where the 
10 TBA is NF-kB and the TBR formed upon TNA-PNA hybridization is one or more NF-kB binding 
sites, the 1/2 BBRs may provide hybridizable (that is, single-stranded, complementary) sequences 
of the left or right bacteriophage lambda operators (see, for example, Ptashne [1982] Scientific 
American 247:128-140, and references cited therein for sequences of these operators). These may 
be polymerized onto the PNA 1/2 BBRs in a vectorial fashion (see Figures 2 and 3) providing up 
15 to "n" BBRs, and each BBR forms a cro bmding site, fiizymatically, radioactively, cm- otherwise 
labeled cro, is contacted with the TBA.TNA-PNA-{BNA)^ complex, hi this Miion, a highly 
selective and an^lified signal is produced. Signal produced using a PNA having a single 1/2 TBR 
indicates success of the assay in achieving TBA-TBR bindmg and polymerization of the BNAs to 
produce signal j&om cellular sites (i.e, from CNAs). Absence of sigial ^en a dimcrized TBA is 
20 used mdicates that in the TNA, there were no HIV LTRs as no double NF-kB binding sites were 
preset Qntheothcrhanipresenceofagnal usingthedimer NF-kB indicates HIV As 
a specific example of the foregoing description of this embodiment of the invention, see Example 
6 describing an HIV test kit 

Naturally, those skilled in the art will recognize that the foregoing description is subject to 
25 several modifications in the choice of PNAs, TNAs, TBAs, BNAs, and BBAs. Furthermore, in 
systems other than HIV, those skilled in the art will recognize that the general method described 
above could be likewise applied. However, these other appHcations may be simpler than the above 
described n^fliod as Ihe TB As used ma^ noi recognize any normal cellular s ites and therefore resort 
to dimerization or other methods of discriminating between TNAs and CNAs may be less critical. 
30 in designing probes and binding ass^bUes for these oto systems, the skilled artisan will be guided 

by the following principles and considerations. 

In Ihe above-desoibed embodiment, the appeal of using the DNA-binding portions of NF- 
kB protein as the TBA and the NF-kB recognition binding elements as the TBRs is that these 
ekments f<xm an io^rtant "control poinf for the rephcation of HIV. That is, it is known that HIV 
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is required to use NF-kB as a critical feature in its replicative life cycle. Similar control points for 
other pathogens are chosen and used as a basis for detection according to the methods described 

From the foregoing description of general features of this invention and the mode of its 
5 operation, one skiUed in the art will recognize that there are a multiplicity of spedfic modes for 

practidngthisinveDtioa By of example, the method of this invention is adaptable to a method 
and devices using dirumatogr^hic test kits described in U.S. Patent Nos. 4,690,691 and 5,3 10,650 
(the '691 and '650 patents). In those patents, a porous medium was used to immobilize either a TNA 
or a capture probe, and a solvent was used to transport a mobile phase containing either a labeled 
10 PNA, if the TNA was immobilized, or the TNA, if a capture probe was immobilized, into the 
"capture zone." OiKe the TNA was bound in the capture zone, either by directly immobilizing it or 
through c^ture, a labeled PNA was chrmiatp^^hed through the capture zone and any bound label 
was detected. 

Adapting the instant inventim to sudi a s>«tem provides the improvement of using a Target 

15 Binding Assembly in the capture zone and therefore, the capture of only perfectly matched TBR 

sequences or other TBRs representing nucleic add confirmations siKscifically bound by the TBA 
within tte TN A-PNA dupleces by virtue of the previously described sensitive discrimination by the 
TBA between TNAs and CNAs. 

Once the TNA-PNA b^iids become bound to the immc*ilized TBA, the signal is amplified 

20 by adding BNAs or chromatographing BNAs through the capture ^ne. FinaUy, the signal may be 
further amplified by adding BBAs or chromatographing labeled BBAs througji the capture zone. 
In this fashion, the case of performing the analysis steps described in the *69i and '650 patents is 
improved upon herem by providing the additional ability to increase the specificity and, through 
amplification, the sensitivity of the method described in those patents. The disclosure of the '691 

25 and '650 patents is herein incorporated by reference for the purpose of showing the details of that 
method and for the teachings provided therein of specific operating conditions to which the 
cxxmpositions and methods of the instant invention are adaptable. 

Those skilled in the art will also recognize that the method of the instant mvention is 
amenable to bang run in microtiter plates or to automation The use of machines incorporating the 

30 method of this invention therefore naturally falls within the scope of the instant disclosure and the 
claims upended hereto. Thus, for example, this invention is adaptable for use in such instruments 
as Abbott Laboratori^* (Abbott Park, IL) IMx tabletop analyzer. The IMx is currently designed to 
run both fluorescent polarization immunoassay (FPZA, see Kier [1983] KCLA 3:13-15) and 
microparticle enzyme immunoassay (MEZA, see Laboratory Medicine, Vol. 20, No. 1, January 
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1989, pp. 47-49). The MEZA method is easily transfonned into a nucleic acid detection method 
using the instant invention by using a TBA as a capture molecule coated onto a submicron (<0.5 ^m 
on average) sized microparticle suspended in solution. The microparticies coated wth TBA are 
pip^ted into a reactitxi cell. ITie IMx thai pip^es sample (hybridized PN A-TNA) into the reaction 
5 cell) fonning a complex with the TBA, After an appropriate incubation period, the solution is 
transferred to an inert glass fiber matrix for which the particles have a strong afSnily and to which 
the microparticies adhere. Either prior to or after filtering the reaction mixture througji the glass 
fiber matrix, BNAs and BB As are added, or another signal amplification and detection means is used 
\^ch depends on specific fomiation of TNA-PNA Irybrids. The immobilized complex is washed 

10 and the unbound material flows through the glass fiber matrix. 

The bound complexes are detected by means of alkaline phosphatase labeled BBAs or 
otherwise (radioactivety^, enzymatically, fiuorescently) labeled BBAs. hi the case of alkaline 
phosphatase labeled BBAs, ibt Hwcosceaat substrate 4-mediyl umbellifeiyl phosphate or like reagent 
may be added Alternatively, the enzyme may be bypassed by directly labeling BBAs with this or 

15 a like reagent In any event, fluorescence or other signal is proportional to the amount of PNA-TNA 
hybrids present 

The fluorescence is detected on the surface of the matrix by means of a firont surface 
flucHxmietier as described by tte manufacturer of the IN^ With minor adjustments that can be made 
through routine experimentation to optimize an instrument such as the IMx for nucleic acid 

20 hybridization and nucleic acid-TBA interactions, the instant invention is completely adaptable to 
automated analyses of TNA samples. 

9. OfeerdiapK?stic applications of this invention . While the foregoing descripticai enables 
the use of the instant invention in a number of different modes, many additional utilities of this 
invention are readily appreciated, for example, in a mobility retardation system 

25 In this emboduncnt of the invention, an improvement of the well known electrc^horetic 

mobility shift assj^ (EMSA) is conducted as follows (See Figures 12a and 12b): 

A sample of DNA is fi*agmented, either through random cleavage or through specific 
xesfirictionendonudease treatment The DNA in the sample is then split into two equal ahquots and 
a specific TNA is added to the first aliquot but not to the second. The first and second aliquot are 

30 then electrophoresed in an aaylamide or agarose gel, and the pattern of DNA bands (cither 

visualized through ethidium bromide binding or througji being radioactively labeled prior to 
electrophoresis is then compared for the two aliquots. Fragments of DNA having binding sites to 
which the TBA is specific are retarded in their migration throu^ the electrophoretic medium By 
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using an appropriate TBA^ ar^ number of DNA or other nucleic acid sequences may be tracked in 
this fashion. 

In a modificadon of the EMSA described above, fragmemed TNA is hybridized with a PNA 
andfra±ionatedinafir^<fimensioa The fi^ocaled DNA is then reacted wth an appropriate TBA 
5 and the change in nubility of the DNA fra^eots is noted Bohancement of the retardation is 
possible by adding BBAs as described above. (See, for e?can?)le, Vijg and references cited therein 
for known techniques of two (2) dimensional nulceic acid electrophoresis, to which the instant 
method may be applied). 

10. Therapeutic applications . Because ofthe very tight and selective nucleic acid binding 

10 characteristics ofthe novel TBAs described herein, therapeutic utilities are contemplated in addition 
to the diagnostic utilities of these compounds. Thus, a TEA conqmsing tight and specific binding 
for the HIV-LTR, by virtue of having an NF-kB p50 and an SPl DNA recognition unit in close 
association (see Figure 10, HIV-Detect 11) is useful to bind up the HIV-LTR and thereby prevent 
transcription from this key element ofthe HIV genome. The unique features of the assembly 

1 5 sequences of the TBA allow recombinant vectcMS to introduce DNA encoding such a TB A into a ceil 

and the proper folding ofthe expressed sequences. Once inside the cell, the nuclear localization 
signals ofthe p50 subunit directs the transport ofthe TBA to the nucleus where it binds tightly to 
the LTR of any integrated HIV, effecdvefy shutting the pathogen down. In a prophylactic mode, one 
that is concerned about potential HIV exposure is administered a sufficient dose of a TBA or a 

20 recombinant vector able to express the TBA, so as to lock up any HIV that might have entered the 

person. In this mode, the use ofthe TBA is analogous to passive protection with a specific immune 
globulin. In the therapeutic or prophylactic mode, NLS sequences are used in place ofthe OSAs 
used in the diagnostic mode. Exemplary NLS sequences are provided as SEQ ID NOS. 72 and 103 
(see also Heinzinger 1994 and Bukinsky 1993, describing NLS sequences ofthe HIV Vpr and gag 

25 proteins respectively). In any event, the TBA is administered in a phannaceutically-acceptable 
earner, known in the art such as a sterile salt solution or associated with a liposome or in the foim 
of a recombinant vector, preferably one which directs expression ofthe TBA in a chosen cell type, 
or by a protein dehvery system. 
D. Embodiments of the Invention 

30 In view ofthe fonegoing description and the examples which follow^ those skilled in the art 

will appreciate that this disclosure describes and enables various ^bodiments of this invention, 
including : 
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1. A probe nucleic acid (PNA) comprising: 

(a) a single-stranded sequence, 1/2 TBR, which is capable of forming, under 
l^rbridizing conditions, a hybrid, TBR, with a 1/2 TBR present in a target nucleic 
acid(TNA); 

(b) zero, one or more, and preferably one to ten single stranded sequences, 1/2 BBR, 
which is enable of forming, under hybridizing conditions, a hybrid BBR, with a 
1/2 BBR preset in a booster nucleic acid (BNA); and 

(c) an OSA, which is no attadied support and/or indicator, or an attached support or 
other means of localization, including, but not limited to, attachment to beads, 
polymers, and surfeccs, and/or indicators; 

wherein said TBR is capable of binding with high aflBnity to a TBA, said TBA being a substance 
capable of discriminating between a paired TBR and a TBR having wpmd nucleotides, and 
further, wherein said BBR is capable of binding with high afSnity to a BBA, smd BBA being a 
substance cs^pable of discriminating between a paired BBR and a BBR having mpmtd micleotides. 
This onbodiment includes TBRs which are nucleic acid binding protein recognition sites, such as 
the HTV LTR, and other nucleic acid binding protein recognition sites in other pathogens, some of 
vMch are noted above. The PNA of this embodiment of the invention may produce a TBR vMch 
is a nuddc add binding protdn recognidan site present in the genome of a pathogen or is a binding 
site associated with a pathogenic condition in the human genome or a contaminant in a fermentation 
process. 

1 A booster nucleic acid (BNA) comprising: 

(a) a 1/2 BBR which has a sequence which is complementary to a 1/2 BBR sequence 
in a PNA or another BNA already hybridized to the PNA and which is capable of 
fcffming, under hybridizing conditions, a hybrid, BBR, with the PNA; 

(b) an OS A attedied support or other means of localization, including, but not Kmited 
to, attachment to beads, polymere, and surfaces, and/or indicators; and 

(c) additional hybridization sites, 1/2 BBRs, for hybridization with additional BNAs; 
wherein said BBR is capable of binding witii high affinity to a BBA, said BBA being a substance 
capable of discriminating between a paired BBR and a BBR having unpaired nucleotides. 

3. A Hairpin Nucleic Add (HNA) coai^rising a single-stranded sequence, 1/2 BBR, which 
under hybridizing conditions is capable of formmg a hairpin vMs at the same time binding to a 
BNA to fonn a BBR capable of binding a BBA, wherein said BBR is capable of bindii^ witin high 
afBaity to a BBA, said BBA being a substance capable of discriminating between a paired BBR and 
a BBR having unpaired nucleotides. 
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4. A method for detecting a specific TNA sequence, comprising the steps of: 

(a) hybridizing said TNA with a PNA as described above; 

(b) hybridizing said PNA with a BNA containing a 1/2 BBR whose sequence is 
complemsntsary to a 1/2 BBR sequence in the PNA; 

5 (c) aMng the products ofsteps (a) and (b) containing a TBR and a BBR, to a sur^^ 

liquid or otfaa* medium containing a TBA; 

(d) adding BBAs to the mixture in step (c) wh^ein said BBA comprises: 

(i) a molecule or a pc«tion of a molecule which is capable of selectively 
binding to a BBR; and 
10 (ii) a detectible indicator, 

and 

(e) detectmg signal produced by the indicator attached to the BBA. This method 
includes the use of a protein indicator, including enzymes capable of catalyzing 
reactions leading to production of colored reaction products. It also includes 

15 indicators such as a radionuclide or colored beads. 

5. AmeflKxifordetectinglhepiesenceinasan^jleofaspecific Target Nucleic Acid, TNA, 
\rfuch comprises: 

(a) contacting said sanq>le with a Probe Nucleic Acid, PNA, which, upon hybridizaticMi 
with said TNA if present in said sample, forms a Target Binding Region, TBR, 

20 vMdi is capable of binding a Target Binding Assembly, TBA; 

(b) contacting said sample, abea^ in contact with said PNA, with a TBA capable of 
binding to any TBRs foraaed by the hybridization of said PNA and said TNA in the 
sample. 

6. A nKtiiod for detecting or localizing specific nucleic acid sequences with aMgh degree 
25 of sensitivity and specificity which comprises: 

(a) adding PNAs containing a 1/2 BBR and a 1/2 TBR to a sample containing or 
suspected of containing TNAs containing 1/2 TBR sequences, to fonn a complex 
having target binding regions, TBRs, formed by the hybridization of 
completnentary 1/2 TBRs present in the PNAs and TNAs respectively; 
30 (b) binding the TBRs formed in 5tq) (a) to an immobilized TBA to fonn a TBA-TNA- 

PNA complex; 

(c) adding Booster Nucleic Adds, BNAs, containing booster binding regions, 1/2 
BBRs, to the complex form^ in steap (b) such that the 1/2 BBRs in the BNAs 
l^bridize withAe 1/2 BBRse<piecices present in the PNAs or to 1/2 BBRs present 
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in BNAs already bound to the PNA, to form BBRs, such that TBA-TNA-PNA- 
(BNA)^ complexes are formed; 

(d) adding Haiipin Nucleic Acids, HNAs, ccntaining 1/2 BBR sequences, to the 
c<si5)lexfomied in stq) (c) such that the 1/2 BBRs in the HNAs hybridize with any 
available 1/2 BBR sequences present in the BNAs of the complex of step (c), 
thereby capping the extension of the BNAs onto the TBA-TNA-PNA-{BNA)„ 
complexes of step (c) to form TBA-TNA-PNA-(BNA)„.HNA complexes; 

(e) adding Booster Binding Asscmbli^, BBAs» linked to indicator moieties, to the 
TBA'TNA-PNA-(BNA)„-HNA coo^Iexes fonned in step (d) to form TBA-TNA- 
PNA-(BNA-BBA),-HNA complexes; and 

(f) ejecting the signals prodiK»d by the indicator moieties linked to the TB As, PN As, 
BNAs, BBAs or HNAs inthe TBA-TNA-PNA-(BNA-BBA)„-HNA complexes of 
step (e); 

wherein: 

the TNA comprises: 

(i) one or more specific 1/2 TBR nucleic acid sequences, the presaice or 

absence of which in a particular sample is to be confirmed; 
the PNA comprise: 

(i) a smgle-stranded sequence, 1/2 TBR, which is capable of forming, under 
Itybridizing cwiditions, a hybrid, TBR, with a 1/2 TBR present in a target 
nucleic acid (TNA); 

(ii) a single stranded sequwice, 1/2 BBR, vAioh is capable of forming, under 
Itybridizing conditions, a hybrid BBR with a 1/2 BBR present in a booster 
nucleic acid (BNA); and 

(iii) an OSA, which is no attached support and/or indicator, or an attached 
support or other means of localization, including, but not limited to, 
attachment to beads, polymers, and surfaces, and/or indicators; 

the BNA comprises: 

(i) a 1/2 BBR, as sboym in Figure l(IIb), which has a sequence which is 
conq>lQnenta!y to a 1/2 BBR sequence in a PNA and which is capable of 
forming, under hybridizing ccaiditions, a hybrid, BBR, with the PNA; 

(ii) an OSA attached siq^port or other naeans of localization, including, but not 
limited to, attachment to te^is, pofymers, and surfaces, and/or indicators; 

(iii) additional hybridization sites, 1/2 BBRs, for other BNAs; and 
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(iv) sequences, 1/2 BBRs, which can hybridize to BNAs already hybridized to 

thePNA; 
the BBA ccMnprises: 

(i) a molecule or a portion of a molecule which is capable of selectively 
5 binding to a BBR; and 

(ii) no attached support and/or indicator, or an attached support or other 
nieans of localization, incltzding, but not limited to, attachment to beads, 
polymers, and surfaces, and/or indicators; 

and the TBA conq>rises: 

10 (i) a molecule or a portion of a molecule which is capable of selectively 

bindii^toaTBR;and 
(ii) no attached support and/or indicator, or an attached support or other 
means of locahzation, including, but not limited to, attachment to beads, 
polymers, and surfaces, and/or indicatcMrs. 
15 7. An improvement to a solid phase hybridization method for detecting the presence of a 

target polynucleotide involving: immobilizing a target polynucleotide, if present in a test saiiq)le, 
directly or via an interasediate capture structure, on a solid phase at a capture site; before, during or 
after said hnmobilization, attaching a detectable label to said target polynucleotide, if present; and 
detecting said label, if any, at said capture site; the improvement conq>rising: 
20 (a) using a Target Binding Assonbly, TBA, as the means for achieving innnobil^ 

of said target polynucleotide, wiierein said TBA binds only to a perfect hybrid 
fcmed between a specific Probe Nucleic Acid, PNA, and said target nucleic acid 
such that a perfect Target Binding Region, TBR, recognizable by said TBA is 
fonned; and 

25 (b) including in the PNA a smgle stranded sequence, 1/2 BBR, capable of binding a 

Booster Nucleic Acid, BNA, caitaining a single stranded con5)lementaiy 1/2 BBR 
which, ijponhybridiz^on with the 1/2 BBR in the PNA, forms a BBR capable of 
binding labeled Booster Binding AssembUcs, BBAs. 
8. A target binding assembly, TBA, comprising one or more nucleic acid recognition units, 
30 linker sequence(s), assembly sequettce(s), asymmetry sequ&acc{s\ nuclear localization signal 

sequence(s) (NLS) and OSA(s). The nucleic acid recognition unit may be an NF-kB binding unit, 
an SPI binding unit, a TATA binding unit, a human papillomavirus binding unit, an HIV LTR 
binding unit, or a binding unit for any other fragment of specific sequence the detecticm of which is 
desirable and which can be achieved through specific association with the TBA. Such recognition 
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units include, but are not limited to those exemplified herein as SEQ ID NO. 63, SEQ JD NO. 64, 
SEQ IDNO. 65, SEQ E>NO. 66, SEQ ID NO. 67, SEQ ©NO. 68, SEQ ID NO. 69, SEQ ID NO. 
70, SEQ ID NO, 71, SEQ ID NO. 72, and SEQ ID NO. 73. Linker sequences such as oligopeptides 
which do not interfere with the nucleic add recognition function of the nucleic acid recognition unit 
5 andvAich provide stdaility and conixol ov^ the spacing of the nucleic acid recognition unit fixan the 

imaindaoftheTBA. Bcanq)lesofsuch linker sequmc^ are weU known in the art an^ 
are not limited to oligopeptide sequences from the interdomain primary sequence of a structoai 
protein. Assembly sequences include oKgopeptide sequences which direct the folding and 
association of nucleic acid recognition units. A preferred example of such sequences are 
10 ohgopeptides derived fiom flie bacteriophage lambda cro protein. The asymmetry sequence directs 
ihe assodatiott of nucleic add recognition and assembly sequences in a predetermined order. Such 
asymmetry sequences are exemplified by sequences derived from insulin, relaxin, gonadotropic 
hcHmone,FSH,HCG,lJI,ACTH,inckdmgbutnotlimitedtoSEQro 85-92. Withreference 
to Figures 14 and 15, SEQ ID NO. S5 is an "A" and SEQ ID NO. 86 is a "B" sequence; SEQ ID 
15 NO. 87 is an "A" and SEQ ID NO. 88 is a "B" sequence' SEQ ID NO. 89 is a human relaxin "A" 
and SEQ ID NO. 90 is a human relaxin "B" sequence; SEQ ID NO. 91 is a dcate relaxin "A" and 
SEQ ID NO. 92 is a skate relaxin "B" sequence, hi addition, the TEA may contain nudear 
localizaticxi signal sequences, NLS, which direct the migration and uptake of a protein or complex 
associated v/i&x said NLS into the nucleus of a cell. Examples of such NLS sequences are provided 
20 as SEQ IDNOS. 72 and 103. Preferred embodiments of tlK; TBA include but are not limited to HIV 

Detect WV or HPV Detect WV, and SEQ ID NOS. 109-1 16. 

9. Methods of using the novel TBAs of this invention include, but are not limited to a 
method of using the TBA to bind a particular nucleic acid sequence in a tar^t nucldc acid sample 
which comprises: 

25 (a) fragmenting the nucleic add in the target nucleic add sample; 

(b) contacting, under b>l>ridizing conditions, the fragmented nudeic acid with a probe 
nucleic acid complementary to the particular nucldc acid sequence of mterest, 
wherein said probe nucleic acid, upon hybridization with said particular nucldc 
add sequence of mterest forms a target binding region to which said TBA 
30 specifically binds. 

In this method, the probe nudeic add, in additiofn to sequences complemoitaiy to said particular 
nucldc add sequence of interest, also nm^ have additional sequences to which a booster nucleic acid 
can bind to form a booster binding site to which a labeled booster binding assembly can bind to 
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provide a signal showing and amplifying the binding of the probe nucleic acid to the target nucleic 
acid sequence of interest. 

An additiaial aspect of this invention not requiring fragmentation of Target Nucleic Acid, 
involves administration of the TBA to a patient in need of such treatment of a thoiapeutically or 
prpphylacticaliy effective amount of said TBA, which comprises administering the TBA, either in 
die f<rm of a purified pictdn conq^lex ch- in fte fcm, of a recombinant vector which, upon entry into 
the patiait is able to e^qsiess the TBA, such that the TBA binds the particular nucleic add sequence 
to adiieve the desired prophyk:tic or fher^)euti^ This may include providmg a dosage ^^h 
can be determined by routine e?q)erimentation to be sufficient to prevent establishment of an active 
infection by a pathogen. Dosages of purified TBAs may be in the range of about 0,00i to 100 
mg/kg. When provided as areccmbinant expression vector which will direct the m vivo es^sression 
and folding of the TBA, dosages of the recombmant nucleic acid may be substantially low^, 
particular^ if provided in the fmn of non-pathogenic viral vector. The methods of using the TBAs 
also include monitoring the shift in mobility of nucleic acids in target nucleic acid samples as a 
fimction of the size sudi that binding of the TBA to a particular fragment in the sample modifies the 
mobility of the fragment This aspect of the method provides a useful method of analyzing nucleic 
acid fragments for particular aberrations, such as might be found associated with metastases. 

10, Diagnostic or fcwreasic kits useful in detenninmg the presence of an infection, the 
susceptibility to a disease, or the origin of a particular nucleic acid containing sample. 

1 L A method of assembling multimeric TBAs in vivo which comprises introducing nucleic 
acids encodmg component TBAs into a cell. The component TBAs should each contain a nucleic 
add recognition unit, assembly sequences, asymmetry sequsices, and nuclear localization signal 
sequences. Lkikst sequences^ optional]^ included if TBA footprinting e?q>eriments indicate the need 
for such linkers to attain q>timal geometry of the multimeric TBA. Upon in vivo expression of each 
compmsQi TBA aiKi pnxdmal binding, via the nucleic acid recognition unit of each corr^nent TBA 
to nucleic acid sequences encountered in the nucleus or elsewhere in the cell, component expressed 
TBAs are directed to assemble via the included assembly and asymmetry sequences into multimeric 
TBAs. As described above, such multimeric TBAs will have the advantage of binding specifically 
with high aJSinity to TBRs in a specific target sequence, but not at all or with very low afflaity to 
cousin nucleic acids. 

The foregoing desoiption of the invention will be appreciated by those skilled in the art to 
enable preferred embodimaits as well as the best mode of this invention. Without limiting the 
subject matter to the specifics of the examples provided heremafter, the following examples are 
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provided to furflier guide those skifled in the art on methods of practicing this invention. Standard 
recombinant DNA techniques as disclosed in Sambrook, Fritsch, and Maniatis (1989) Molecular 
Cloning: A Laboratory Manual, 2nd Ed, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, and more recent texts are not disclosed as these are now weU within the skill of the 
5 ordinaiy artisan. 

Kvample 1 ■ Preparation of PNAs and Labelmg of PNAs 

Probe nucleic adds, PNAs, msy be prepared by means well known in the art. Thus, single 
stranded polynucleotide PNAs of defined sequence may be prq)ared via solid phase chemical 

10 synthesis according to Merrifield PNAs may be prepared by automated synthesis using 
ccsnmerdally available technology, such as resins and machines produced or marketed by Applied 
Biosystems, ABI, ch: otho* manu&cturers. Alternatively, through known recombinant DNA methods, 
particular PNA sequences are synthesized in vivo, for example by clomng a duplex PNA into a 
vector which can replicate in E. cob, large quantities of the duplex PNA may be prepared. 

15 Multimers of tiK PNA may be cloned into the vectOT sudi that for each mole of vector, several moles 

of PNA is liberated xspotx digestion of the vector with a restriction fragment flanking the PNA 
sequwice. Subsequent to synthesis or recombinant production, the PNAs arc purified by methods 
well known in tiie art sudi as by gd dedrophcaesis or hi^ pressure liquid chromatogr^hy (HPLC). 
If the PNA is produced as a diqjlex, prior to use in a hybridization assay for detection of target 

20 nucleic arid sequences, the strands of the PNA are separated by heating or other methods known in 
the art 

The specific sequence of bases in the PNA is chosen to reflect the sequence to be detected 
in a TNA, with the |»oviso that, according to this invention, the PNA contains a 1/2 TBR sequence, 
which is one that vpm hybridization of the PNA and TNA, a TBR is formed. As there are an 
25 essentially unlimited number of such sequ^ices known in the art, the choice of the PNA sequence 
is amenable to sdecticHi by the skiUedreseardierfOTar^^enappUcation, The sequence of the HIV 
LTR is one such sequence, which i^on hybridization of a PNA encoding portions of the LTR with 
TNAs encoding the HIV LTR, TBRs capable of binding the NF-kB or SPl DNA binding proteins 
are formed, 

30 In addition to sequences which will form a TBR upon hybridization, the PNA also may 

contain a 1/2 BBR. This sequence is one which, upon hybridization with a booster nucleic acid, 
BNA, forms a BBR which is capable of binding a BBA. The BBA is preferably a DNA binding 
protein having high affinity for the BBR sequence. 
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M this partiojlaremrple, hybridization betwe^ SEQIDNO, 
4 and, at the 3 ' end of that sequence, a 1/2 BBR sequence shown as SEQ ID NO. 35. The PNA 
encoding these sequences is eite used without labeling or is labeled with a radioactive isotope such 
as S^^ or a similar isotope, according to methods known in the art. Alteroatively, the PNA is 
5 boundtoabeadofbetweenO.Ol to 10^ which may be colored for easy visual detection. This 

label forms the OSA as described in the specificatioa This probe hybridizes with HIV LTR 
sequences to form a TBR that bindsNF-kB. In addition, the PNA hybridizes with BNAs having 
a compiemeotary 1/2 BBR to fonn a bacteriophage lambda left operator that bmds dthcr cro or 
lambda repressor proteins. 

10 Ma manner similarto that (fescribed above, PNAs are used wherein the 1/2 TBR is any one 

of SEQ ID NO. 5 or SEQ ID NOS. 7-34, and a 1/2 BBR, such as SEQ ID NO. 35 or SEQ ID NO. 
36 is either at the 3' end or 5' end of the 1/2 TBR. 

Example 2 - Preparation and Labeling of BNAs 

15 Similar to the mdhods described in Example 1 forinepararion andlabeUngof PNAs,BNAs 

are prepared and labeled according to methods known in the art As described in U.S. Patent No. 
4,556,643, herein incorporated by refer^ce (see particularly Example 1), nucleic acid sequences 
encoding particular nucleic acid binding sequences may be mass produced by cloning into a 
replicable vector. FurthOTiore, similar to that disclosure, the 1/2 TBR and 1/2 BBR sequences may 

20 be co-Unearly produced in this fashion, with the distinction, however, tiiat according to the instant 
invention, the 1/2 TBR sequOTce itself forms a nucleic acid binding component recognition site and 
the 1/2 BBR, while fomiing a miclric add binding comporm recognition site, also provides a means 
of anq>lifying the signal produced vapoa binding of the 1/2 TBR to complementary sequences in tiie 
TNA by providing for polymOTzation of BNAs onto tiie TNA bound PNA. To enable this, a 

25 sequence such as SEQ ID NO, 35, vAich encodes the left operator of bacteriophage lambda, is 

provided with additional sequences such that an overhang sequence is created on one or both ends 
of the BNA upon hybridization with the PNA. 

As a specific example, vectorial polymerization of BNAs onto a TNA is provided by SEQ 
ID NOS. 40-43. In this example, SEQ ID NO. 40 encodes two 1/2 TBRs which will hybridize with 

30 two 1/2 TBRs in a TNA to form two NF-kB binding sites, while at the same time providing a 
bactaiophage lambda left oper^ 1/2 BBR, which additionally is terminated at die 3' end with the 
recognition site f^the restriction en:^e Pstl Addition of the BNA, SEQ ID NO. 41, witii tiie 1/2 
BBRcomplementafy tothe 1/2 BBRon the PNA, SEQ ID NO. 40, completes the BBR while at the 
same time completing the Pstl recognition site, leaving a four base overhang for hybridization with 
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additional BNAs, Accordingly, SEQ ID NO, 42 is added which has a four base pair sequence at the 
3 ' end vAndk is ccir5)lementary to the four-base overhang remaining from the hybridizadon of SEQ 
ID NOS. 40 and 41. In addition, SEQ ID NO. 42 is provided with a five base sequence at its 5 ' end 
vtech forms part ofafiamHI recognition site. The growing polymer ofBNAs is extended further 
5 by addition of the BNA SEQ ID NO. 43, which is complementaiy to SEQ ID NO. 42, completing 

the BBR while at the same time completing the BamEi recognition site and leaving a four base 
overhang which may be furflier hybridized with BNAs having con^lementary sequences. In this 
fa^on, the BNAsmaybel^ridized extensively so as to greatly an^lify the signal of a smgle PNA- 
TNA hybridization event 

10 As with the PNAs described in Example l,fiie BNAs may be used in an unlabeled form or 

may be labeled according to methods known in the art and described in Example 1 . It will also be 
appreciated that, rather than produce the BNA p olymer by sequential addition of BNAs to the PN A- 
TNA comqplex, the BNA polymer m^ be preformed and added directiy to the PNA-TNA complex. 
One siiqple method for preforming such a BNA polym^ includes the recombinant production of a 

15 vertor in which multimas of the BNA arc provided witii a unique restriction site at eitiier end of tiie 
pofymer. This potymer ofBNAs containing multiple BBRs is cut out of the vector and hybridizes 
to a single stranded 1/2 BBR remaining in the PNA upon hybridization of the PNA and tiie TNA. 
This is accomplished by providing a single stranded sequence in the PNA complementary to an 
overhang produced in the BNA polymer when it is excised from the production vector 

20 

Example 3 - Production of HNAs and Their Use for C apping BNA Polymers 

The HNAs of this mvention are produced according to methods known in the art for 
pofynucleotideiKoduction as described in Bcamplesl^ hi the production 

of the HNAs, however, the sequence of the HNA is specifically designed so that a substantial portion 

25 of the HNA forms a self-complementary palindrome to form a hairpin, while at the same time, 
kavii^ in single stranded form enough bases to be able to hybridize with single stranded sequences 
in the growing cham ofBNAs described in Example 2. 

In this Exan^le, a HNA of SEQ ID NO. 44 is provided to cap flie extension ofBNAs onto 
the PNA in Example 2 after the addition of the BNA, SEQ ID NO. 43. This is acccanplished 

30 because SEQ ID NO. 44, ^c having a palindromic sequence that farms a stable hairpin, also has 
a sequence at the 5' end of tiie HNA which completes tiie BamHl sequence formed by tiie 
l^^bridization of SEQ ID NO. 42 and SEQ ID NO. 43. Naturally, termination of the polymer after 
addition of only 3 BNAs is for the purpose of "simphcity in demonstrating tiie invention. As 
described above, tiiis polymerization m^ be continued essentially indefinitely to ampEfy the signal 
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of the PNA-TNA hybridization event Once the HNA hybridizes to the growing chain of BNAs, the 
polymer is capped and no fiirther extension of the polymer is possible. 

Example 4 - Preparation of TBAs and BBAs. Labeling, and hnmobilization Thereof 

5 The TBAs and BBAs vMcb may be nsed according to the instant invention include ar^^ 

substance which can specifically bind to the TBRs and BBRs formed by hybridization of the PNAs, 
TNAs and BNAs. Use of DNA binding proteins forms one example of such substances. 

For this CKsmpk, the TBA is the dimer of the DNA binding portion of p50, and the BBA 
is the lambda cro protean. These proteins may be produced according to methods known in the art 

10 The genes for both of these proteins have been cloned Thus, these protems are recombinantly 
produced and purified according to methods known in the art Furthennore, these proteins are 
labeled, either with a radioisotope, such as radioactive iodine, or with an enzyme, such as beta- 
galactosidase orhorser^ish peroxidase, or with a fluorescent dye such as fluorescein or riiodamine, 
according to methods well known in the art. In addition, either or both of tibe TBA and BBA may 

15 be immobilized on a solid surf ac e such as the surface of a microtiter plate or the surface of a bead, 
such as a colored bead of diameter anywhere firom 0.01 to 10 fim. The labels on the TBAs and 
BBAs may be the same or different 

In Ons exan^>le, the TBA containing the din^c p 5 0 DNA binding domain i s labeled with 
ifaodamine, while the BBA, cro, is labeled with fluoresceia Accordmgly, upon hybridization of the 

20 PNAs, TNAs, BNAs and HNAs as described in this patent disclosure and the foregoing and 
following examples, the nucleic acid hybrids, if formed, are contacted with excess labeled TBA and 
era. The fluorescence of these labels is measured according to known methods and, d^ection of 
both signals is indicative of the presence of 1/2 TBR sequences in the TNA. The diff^ential signal 
produced by the fluorescence of the NF-kB and cro is a measure of the degree to which the 

25 polymerization of BNAs onto the PNA-TBA hybrid has resulted in ampHfication of the si^. 
Amplification firom one to over a thousand fold is contemplated according to the method of this 
invention. 

Example 5 - Hybridization of two PNAs with a TNA and Discrimination BdAveen a TNA and a 
30 CNA 

Tlie PNAs, PNAl, SEQ ID NO. 40 andPNA2, SEQ ID NO. 45, are used in about ten-fold 
molar excess over the concentration of TNAs in a test sample. For this example, an isolated duplex 
HIV LTR, wherein one strand of which has the sequence SEQ ID NO. 37, shown in Figure 7, and 
the otha- strand of which is complementary to the sequence shown in Figure 7, is used as the TNA. 
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A dupkK isolated CNA is also used in this example, one strand of \Aich has the same sequence as 
SEQ DNO. 37, except that, in the first NF-kB binding site shown in Figure 7, at the center of the 
binding site, position 1 in Figure 7, mstead of a "T," there is an "A," the con?)lcmcntaiy strand of 
which therefore mismatches with the SEQ ID NO. 40 PNA at that locatioa 
5 SEQ K> NO. 40 and SEQ ID NO. 45 are both added to separate reactions, the first 

containing the above described TNA and the second containing the above described CNA. The 
saxnples are solubiliTed in an appropriate hybridization buffer, such as 10 mM Tris (pH 7.5), 1 mM 
EDTA. The sanqiles are heated to about 90*C for about five minutes to strand separate the duplex 
xNAs and CNAs in the sanqsles, and then the samples are allowed to cool to allow strands of PNAs, 
m 10 TNAs and CNAs to anneal. 

j; : Once the hybridization has gone to coit?)letic3n, ^siiich can be detamined according to known 

rfj methods such as by calculating the tI/2 based on base compositions and annealii^ temperature 

% according to known methods, the SEQ ID NO. 40 PNA is polymerized by addition of BNAs as in 

s Example 2 and the SEQ ID NO. 45 PNA2 probe is polymerized with BNAs starting with Sphl 

55 15 recognition site pveriiang. Following addition of the BNAs and a brief lr^bridizationperio4 the 

f 11 separate sanq)ies are added to beads coated with covalently immobilized NF-kB, and the NF-kB is 

2 allowed to bind to any TBRsfiMmed in the TNA ami CNA s^ After about 15 minutes of 

binding, the samples are washed twice with about three volumes of an ^propriate washing buffer, 
such as 10 mM Tris, pH 7.5, 1 00 mM NaCl, or another buffer i^e-determined not to int^ere with 
20 NF-kB, or bacterioph^e lambda CI repressor protein binding activity. After each wash, the beads 
are allowed to settle under gravity or by brief centrifugation. This removes any nucleic acids which 
do not have a perfea NF-kB binding site formed by hybridization of the PNA i and TNA sequences. 

Atothefinal wash, bactoiophage lambda CI repressor protein labeled with a radioactive 
isotope, such as with radioactive iodine, or labeled with an enzyme, such as horseradish peroxidase, 
25 with colored beads, or with a fluorescent label is added to each sample. The samples are then 
washed sevaal times (dboot 3) with several volumes (about 2) of an appropriate washing buffer such 
as 10mMTris,pH7.5, 100 inM NaCl, or another buffer pre-determined not to interfere with NF- 
kB, or bacteriophage lambda CI repressor protein binding activity. After each wash, the beads are 
allowed to settle under gravity or by brief centrifiigation. Following the last settling or 
30 centrifiigation, the bcnmd label is quantitated by detecting tiie bound radioactivity, liberated color in 
an enzymatic assay, color of bound beads, or fluorescence detection. Alternatively, an anti-Cl 
antibocfy can be added and a standard sandwich enzyme linked immunoasss^ or radioimmunoassay 
performed to detect bound repressor, hi addition, as a negative control (background), all of the 
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fc^egoing manipulations are carried out in tandem with a sample in which beads are used having no 
immobilized NF-kB. 

As a result of the foregoing assay, the control and CNA containing samples have similarly 
low signals while the TNA containing sample has a signal well above background 

5 

Rvample 6 - A Test Kit for the Detection of HIV 
A. Kit contents: 

1. MiCTOtiter plate. 

2. 1 mg/mL solution of recombinantly produced NF-kB in tris-bufF^ed 
10 saline. 

3. Tube containing single stranded HIV PNAs (a mixture of pre-mixed 
oligonucleotides encoding two NF-kB 1/2 binding sites, i.e. a mixture of 
SEQ.ID.NosJandS). 

4. Tube containing single stranded human genomic PNA, SEQ ID NO, 1. 
15 5. Tube of nuclease (Pstl). 

6. Tube of protease. 

7. Tube containing pre-polymerized BNA's, 100 repeat units of 
bacteriophage lambda capped with an HNA but with fi^ 1/2 BBRs 
available for binding to PNA-TNA hybrids. 

20 8. Tube ofhorseradish peroxidase Om?>) conjugated era 

9. Tube of hrp colored substrate. 

10. Tris buffered sahne, 100 mL. 

11. Lancet. 

12. Reaction tubes A, B, C, each containing 250 of distilled water. 
25 13. Medicine dropper, 

B. Assay method: 

(a) The nricrotitcr plate (item l)iscoat8d with the solution of recombinantly produced 
NF-kB (item 2) at a concentration of I mg/mL in tris buffered saline overnight at 

30 4''C with rocking. 

(b) Three drops of blood of the test taker is obtamed 1^ pricking a finger with the 
lancet (reagmt 1 1), and a drop of blood is dispensed into each of reaction tubes A, 
B, and C (reagent 12). 
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(c) Into each tube is dispensed one drop of protease solution (reagent 6) with the 
medicine dropper (item 12) and the tube agitated and allowed to sit for 5 minutes. 

(d) One drop of nuclease (item 5) is added to each of tubes A-C using the medicine 
dropper and the tubes agitated and allowed to sit for 1 0 minutes. 

5 (e) One drop of item 3 is added to tube A (test sac:?)le); one drop ofitem 4 is added 

to tube B (positive conttol); and one drop of saline (item 12) is added to tube C as 
an^ve coiitroL The tubes are heated to SOT in hot water and allowed to cool 
to room temperature over one hour. 

(f) While tihe I^ridizadon is allowed to occur in step (d) , the excess protein is drained 
1 0 fiom flie saddo^ and die micr<^ter plate, fix>m step (a) , and the plate is rinsed with 

tris buffered saline (tube 10). 

(g) The contents of tubes A-C fiom step (e) are transferred to three wells of the 
microtiter plate and allowed to stand for I hour with rocking. 

(h) The microtiter wells containing the contents of tubes A-C are rinsed with tris 
15 buffered saline and emptied. 

(i) OnedropofitemTis added to each well and allowed to hybridize with aity 1/2 
BBR sites bound to the plate, over <Hie hour, followed by three rinses with tris 
buff^ed saline. 

(j) One drop of item 8 is added to each well and cro is allowed to bmd to any bound 
20 BNA*s over 10 minutes, followed five, one mL washes with tris-buffa:ed saline. 

(k) Oi^ drop of hrp substrate is added to each well and color allowed to develop. 



C. Results: 

If wells A and B both show color development, and well C does not, the test is valid and the 
25 subject has been infected with HTV. If only well A shows color developmKit, or if well C shows 
color development, the test has been performed incorrectly, and is invalid If wells A and C show 
no color ^vdopmait but well B does, the test is valid and the individual has not been infected with 
HIV. 



30 Example 7 - Producticm of Various Novel IB As 

Novel TBAs for use according to the instant invention are prepared as follows: 

NFkB/NF*kB fHIV-Dctect R . A nucleic acid encoding any one of SEQ ID NOS. 63-71 
or a like NF-kB DNA bindmg protein, is fused, in frame, to a nucleotide sequence encoding an 
assembly seqim», such as cro, such that the NF-kB DNA reco©iition sequence is encoded at amino 
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or carboxy teminus of the cro sequence. Optionally, a linker sequence is provided betw'een the NF- 
kB sequence and the cro sequence. At the other terminus of cro, a nuclear localization signal 
sequence, such as SEQ ID NO. 72, is optionally provided. Further, asymmetry sequences are 
optionally provided at the cro terminus unused by the NF-kB recognition sequence. Examples of 
complete TBAs are shown below. 

Vd) NF-kB/SPl YHIV-Detect ID , In a similar fashion to that described in (a) above, a 
recombinant coding sequence encoding an NF-kB reception domain is prepared. In a separate 
construct, mstead of SEQ ID NOS. 63-72, the coding sequence for the DNA recognition portion of 
SPl is included. Such a sequence should ^code all or a functional part of SEQ ID NO. 73, which 
is&atportionoftheSPl transcription factor exhibiting DNA binding {see Kadcmaga er at [1987] 
Cell 51:1079-1090). The NF-kB-encoding vector and the SPl-encoding vector are then co- 
transfected into an appropriate expression system such as is well known in the art A monomeric 
NF-kB recogniticHi unit is added to complete the NF-kB recognition dimer after the assembty of the 
SPl and NF-kB lecognidon units by the ch^OTOoe. The asymmetry sequaioes prevent the formation 
of NF-kB or SPl dimers and direct, instead, the formation of NFkB-SPI heterodimers (/.e., HIV- 
Detect n), which are then isolated from the expression system (m a mm a li an or bacterial cells) by 
known methods. 

(c\ SPl/SPI TBAs rmV-Detcct im . As described in (b) above, an SPl-encoding TEA 
construct is prepared. However, only this construct is transfected into the ^ression system, and 
asynunetry sequences allowing the formation of SP 1-SP 1 dimers are included. 

(£s SPl -TATA fHIV-Detect IV) . As described in (b) above, an SPl-encoding TBA 
recombinant is produced. In addition, a recombinant encoding a TBA having the binding sequence, 
SEQ ID NO. 74, or like sequence encoding a TATA recognition unit is prepared with as>Ti2metry 
sequences con^Iementary to those included in the SPl TBA-encoding construct These constructs 
are co-transfected and the heterodimers isolated by standard methods, including afSnity purification 
on a DNA column having the appropriate SPl-TATA target binding regions. 

(e) SP1'E2 fHPV-Detect D . An SPl-encoding construct is prepared as in (b) above. An 
El TBA-encoding construct is prepared by using a sequence encoding aiQ^ one of S EQ ID N OS . 7 5 - 
84 and 94-98 which are papillomavirus E2 DNA recognition units (see Hegde et al [1992] Nature 
359:505-512) or like recognition units, is prepared and co-transformed or co-transfected with the 
SPl TBA-encoding construct. Monomeric E2 recognition unit is added to the complete E2 
recognition dimer after the assembly of the E2-SP1 recognition unit by the chaperone. The 
heterodimer HPV-Detect I is isolated according to known methods. 
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(fi E2-E2 (HPV'Detect III As described above in (e), an E2 TBA-encoding construct is 
prepared, except that asynmietry sequences are included which permit the formation of E2 dimers. 
The ejspressed dimas are then isola^ by known meflKxls including affinity for a dimeric E2 binding 
site on a DNA afBnity column. 
5 fg^ E2-TATA rHPV-Detect m As described above in (e) and (d),E2 and TATA binding 

TBAs m prepared (re^jcctivcly), except that asymmetry sequences are included which enhance the 
fomiatiOTofh^odimsrs rather than homodimers. These ccmstructs are then co-e?qMressed and the 

heterodinKTs are isolated 

(h) TATA-TATA fflPV-Detect N) , As described above in (a) and (d), a TATA binding 
10 TBA-encoding construct is prepared using asymmetry sequences that encourage this homodimer 

formation and the homodimer is isolated; 

(i) Other TBAs > As described above for HIV and HPV TBAs, TBAs for any given 
pathogen or disease state may be produced by identifying specific DNA binding proteins and 
forming an expression construct using appropriate linker, assembly, and asynmietry sequences. 

15 

Example 8 

la a similar &shion to the ass^ described in Example 5, a more stringent assay is produced 
by usingthedip^NF-kB-SPibindingprotein prepared accordm^ Accordingly, the 

probes shown in Figure 7 and used in Exan^le 5 may be lenglhm^ to reduce the interprobe distance 
20 and thereby reduce the flexibility of the DNA in the TNA. 

Exam ple 9 - Production of "High-Qrder" TBAs 

By the appropriate use of asymmetry sequKJces, TBAs are produced which are dimers, 
trimers, tetrameres, pentan^s, or hexamers of particular DNA recognition units, hi this fashion, 

25 a hexameric TBA is produced by making a first NF-kB p50 dimeric TBA using asymmetry 

sequences which enable dimer fwrn^oa In addition, the asymmetry sequences enable the 
tetramerizationofthepSOdimerwithanSPl-SPl dimer. Finally, additional asymmetry sequences 
direct the hexamerization with a dimer exhibiting nuclear localization sequences. This is 
accomplished by incorporating, for example, asymmetry sequences from insulin, which in nature 

30 ftmis hexamers. This hexamer formation is directed by the sequences, SEQ ID NOS. 85 (A) and 
86 (B), 87 (A) and 88 (B), 89 (A) and 90 (B), and 91 (A) and 92 (B) (see Figures 13 and 14). 

Because of the extrmiely high affinity for the HIV-LTR that can be generated using a 
multimOTC TBA, the compounds having this structure and which can be used for this purpose are 
referred to herein as "HIV-Lock." 
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An optimal HIV-Lock is defined by footprinting (according to methods well knovm in the 
art) TBAs bound to TBRs in the HIV LTRto confimi that the binding aflBnity of each DNA binding 
protein contributing to the formation of the multimeric TBA complex is downshifted relative to the 
aflBnity for any natural target sequence (Le. CNAs) from which the DNA binding reception unit 

5 of the TBA is derived. Any concomitant loss m binding affinity for the HIV TBRs is more than 

compensated for upon formation of the muitimer as described below. 

There may be con5)etition between the binding of e^h component TBA for its TBR and 
assembly, via asymnaetrysequQices to form the muitimer. This is obviated by adjusting the linkers 
bdwem the cfa^)aone and asymmctiy sequences in each TBA conqx^n^ such that these compe^g 

10 events are uncoiq)led. The resultant reduction in the dimensionality of dififiision (effective 
concentration increase) for the TBA asymmetry and assembly components results in effident 
formaticHi of the multimeric complex. 

On flie basis of the foo^rinting. the length and composition of lintes is adjusted to achieve 
optimal discrimination between target HIV sequences and natural sequences. In this fashion, 

15 ahkHi^ each conqxment TBA will have a low afiSnity for CN A and TBR sequaices, the multimeric 
complex win have an extremely high affinity for the now e?q3anded TBR recognized by the 
multimeric ccxnplex (the square of the affinity of each TBR recognized by each component TBA of 
the multimeric TBA), while still havmg a low affinity for CNAs. In the same fashion, other 
multimeric TBA complexes, aside from HIV-Lock, are prepared 

20 TBAs which can be formed in this fashion include the following sequences, which are 

assembled by linking either the protein subunits or nucleic acid sequences encoding these subunits, 
as follows: 

Set Link Sequences from Groups 

A i+n+in 
25 B iv + v + m 

c iv+in 

vdierein groups I-V consist of sequences selected from: 
Group Selected from Sequences 
I AnyofSEQ ID NOS. 85-92 

30 n MetSer,UnkedtoanyofSEQIDNOS l(M-106,e^hofwhichislinked 

toSEQIDNO. 99. 

m SEQ ID NO. 100 linked to any of SEQ ID NOS, 75-84 or 94-98; SEQ ID 

NO. 101 linked to either SEQ ID NO. 74 or SEQ ID NO, 93; or SEQ ID 
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NO. 102MedtoSEQIDNO. 74orSEQn)NO. 93; or any of SEQ ID 
NO. 72, 103. 73. or 63-71. 

IV AnyofSEQIDNOS. 104-106. 

V SEQ ID NO. 99. 

5 Specific examples of such TBAs are SEQ ID NOS. 109-1 16, assanbled as follows: 



Sgt 


SEOroNO. 


lank SEC IDS 


A 


109 


85 + Met Ser + 104 + 99 + 100 + 94 


A 


110 


85+MetSer+104 + 99 + 72 


A 


111 


86 + Met Scr + 105 + 99 + 102 + 74 


A 


112 


86 + Met Ser +106 + 99 + 73 


A 


113 


89 + Met Ser +106 + 99 + 63 


C 


114 


106 + 64 


C 


115 


105 + 64 


B 


116 


106 + 99 + 73 



15 

In this fashion^ choosmg between appropriate asymmetry sequences, assCTibly sequences, 
and DNA lecc^pMon units, many difiarent TBAs may be formed. Furlbennore, sets of these, such 
as SEQ ID NOS. 114 and 115, \vmassodate with each other but dimers of SEQ© NO. I14or 115 
will myt form due to charge rq>ulsian in the mutated assembly sequences (SEQ ID NO. 104 is cro; 
20 SEQ ID NO. 105 is a novel mutated, negatively charged cro, and SEQ ID NO, 106 is a novel 
mutated, positively charged 

Naturally, givea die amino acid sequence of these TBAs, one of ordinary skill could produce 
recombinant nucleic acid clones ^icoding these, and such recombinant clones naturally fonn an 
integral part of this inventicML 

25 

Example 10 - HIV Test Using "HTV-LOCK" 

in much &c same method as used in Example 6, the "HIV-LOCK'^ produced according to 
Example 9 is used as the TBA, reagent 2, with similar results. 



30 



Example 1 1 - HIV Test Using "fflV-LOCg^ When Testing Blood for Donation 

When the quantity of blood to be tested is not limiting, as vrfien samples of blood for 
dcoation are to be tested for HIV contamination, tests similar to Example 6 are run, but for each of 
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tubes A-C, about 5 mL of blood is pelleted in a tabletop centrifuge. Other reagents are scaled up as 
necessary to handle the larger quantity of TNA present in the sample. 



Exam ple 12 - 'TflV-LQCK" as an Anti-HIV Therapeutic Agent 
5 "HIV-LOCK" produced according to Example 9 is fOTtnulated as a 1 mgtoL solution in 

}q)oscmes and injected intravenously into a subj ect vibo has been tested and confirmed to b e infected 
with HIV. A d<^ of about Oi mg to 100 mg of "HIV-LOCK"/kilogram bocfy mass is infiised over 
atwenty-four hour period and the concentration of HIV p24 in the patient's serum monitored The 
treatment is repeated as often as necessary, such as when elevations in the serum p24 occur, 

T?if ample n - Use of an fflV-TBA Construct as a Therapeutic 

Areccmbinant retroviral or like vector is xised to deliver a construct encoding an HIV-LTR 
binding TBA to an infected patient The vector encodes a chaperone, such as cm, and sequences 
DNA for binding portions of p50. The san» vect<»- also encodes a chaperane on which an SP 1 TBA 

15 folds. Asymmetry sequences are provided such that upon co-e3q)ression of the p50-TBA and the 
SPl-TBA in a single HIV mfected cell in vivo, an immediate association occurs between these 
TBAs, \sliile at the same time preventing any association between the DNA binding portion of p50 
and endogenous p50 (M:p65 monomers. NLS sequaices are also provided in the TBAs so that, upon 
dnner fonnatioo, the TBA immediately relocates to the nucleus of the cell and binds specifically to 

20 integrated HIV sequences, thus preventing any transcription from that locus. 

For this purpose, it is desirable to select sequences Kicoding DNA binding domains such 
that the expressed monomers are assembled into a TBA which does not bind to natural human 
seqiwnces. Thus* it is only npon binding of the TBA components to their target sequences that 
association between all ccanponents of the TBA occurs to form a complex which tightiy and 

25 specifically binds the HIV LTR. 



Kyamptft 1 4 - Diagnostic Test Kit for Human Papillomavirus 

This diagnostic for human p^illomavirus takes advantage of tiie known differential between 
benign and carcmogenic HPV to provide a test which indicates the susoeptibiUty to maUgnancy in 
30 a patient The p^illomaviruses are a group of small DNA viruses associated with benign squamous 
epithelial cell tumors in higher vertebrates. At least 27 distinct human types of papillomaviruses 
(HPVs) have been found; many of these have been associated with specific clinical lesions. Four of 
these, HPV-6, HPV-1 1, HPV-16, HPV-18, and HPV-33 have been associated with human genital 
tract lesicms. In general, HPV-6 and HPV-11 DNAs have been found associated with benign lesions 
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of the genital tract. HPV-16, HPV-18, and HPV-33 have also been found associated with 
premalignant and malignant lesions and are transcribed in most cell lines established from cervical 
carcinomas. HPV-16,HPV-18, and HPV-33 arelikely to be only two members of a large set of 
HPV DNAs associated with malignant human cervical carcinomas, 

5 Animal models have shown that benign papillomavirus lesions can prog?:ess to malignant 

lesions in the presence of a co-carcinogen. HPV DN A has been found in metastases of cervical 
carcinomas, himalignant covical lesions, HPV DNA is usually integrated into the human gencnne, 
but there m^ also be extrachromosomal HPV DNA present Integration of HPV to form the 
provinis usually results in the disruption of the viral E2 open reading frame (ORF). Despite 

10 disruption of tl» E2 ORF, and examination of cell lines from several cervical carcinomas has shown 
transcriptionaUy active and integrated HPV-16 and HPV-18. When HPV-16 genomes which are 
present in the human cervical carcinoma cell lines SiHa and CaSki have been examined, there are 
differences found in the integration of HPV-16. In the SiHa hne, the single HPV-16 genome 
integration occurred at bases 3132 and 3384, disrupting the El and E2 ORFs with a deletion of 03 

15 kb. An additional 50-basepair deletion of HPV-16 DNA resulted in the E2 and E4 OFRs being 
fused. The 5' portion of the HPV-16 DNA, consisting of the disrupted E2 ORF, is ligated to 
continuous human right flanking sequences. In addition, a single additional guanme is detected at 
nuciecrtide 1138 inthemiddieoftheEl ORF. This basepair addition results in the fusion of the Ela 
and Elb ORFs to a single El ORF. 

20 Hie con]?)lete genome of HPV-16 is available on GenBank as accession number K02718; 

the complete genome of HPV-33 is available on GenBank as accession number M12732; the 
complete genome of HPV-18 is available on GenBank as accession number X05015. 

As a preliminary screen, the fact of an HPV infection is established for a given cervical 
biopsy smxplc hy a smipte "yes/no" type of analysis using, for example, any or aU of the PNAs SEQ 

25 ID NOS, 46-53 and an E2 TBA as described above {te., fragment DNA, binding the PNA, 

immobilize with the TBA, and detect signal with BNAs and BBAs). 

Once a biopsy sample is found to be positive for HPV, additional information is obtained 
as to the malignancy potential of the HPV by analyzitig the integration status of the vims in the 
hxmian genome. 

30 L Fragment the DNA in the cervical biopsy sample and hybridize to a blocking probe 

having the sequence, SEQ ID NO. 60. This probe will bind to all tiie fragments in 
the DNA which have not sphced out the 0.3 kb fragment. 
2, Expose the DNA in the biopsy sample to a PNA having die sequence, SEQ ID NO. 
61. This probe will only bind to fragm^ts which have deleted the 0.3 kb fragment 
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(the blocking probe will prevent the looping out of the large deletion segments if 
present). 

3. A PNA having SEQ ID NO. 62 is hybridized with SEQ ID NO. 41 to fonn a BBR 
which will bind to cro or X CI repressor as a BBA, leaving a single-stranded 

5 pardoncapableofhybridiangwiththeTATAsiteottSEQroNO.^ Thisadded 

to form a TBR on the 5' end of the large deleticm. 

4. The TBR is immobilized by a TBA having a TATA binding protein DNA 
recognition unit. 

5 . The bound fr^ments are d^ected by adding BNAs and BBAs as described above. 
10 Detection of signal in this assay indica^ that the large fragmrait is deleted in HPV present 

in the TNA. Since this deletion is correlated with malignancy, thk assay provides insight into the 
malignancy potential of the HPV infectioa This conclusion can be confirmed by performing an 
analogous assay based on the deletion of the 52-basQ)air fragn^nt which is also correlated with 
HPV-induced malignancy, 

15 The TBP reco^iiti(»i unit used in the TBA for this as say may be chosen, for example, from 

a sequence such as SEQ ID NO. 70 or SEQ ID NO. 93. 

Exam ple 15 - Reco mbinant HIV-^LQCK™ ProducticHt. 

PhaseOne-PreparationofDNAtoProducetheHIV-Lock™. ftv/rro mutagenesis of 

20 the coding regions of the naturally occurring, cloned ccHiq)onents of the HIV-Lock^ which need to 
be modified is performed with a MutaGene Phagemidkit The modified protocol includes the use 
of a Blue-script plasmid containing each of the binding componmts of HIV-Lock^. These are 
transformed into competent cells and uradl-containing phagemids are grown. Single stranded DNA 
is extracted and used as a ten^jlate for Aerautagaiicstra^ Oligonucleotides ccmtaining the desired 

25 mutations, including the incorporation of a novel restriction site, are synthesized and treated with 
polynucleotide kinase and ATR The kinase treated oUgcmucleotides are annealed to the single- 
stranded template, and a mutagenic strand is synthesized and Hgated according to the MutaGene 
protocol, with the exception that Sequenase 2.0 provides the polymerase. Libraries are screened 
using both g-^^ end-labeled nucleotides containing sequences complementary to the introduced 

30 mutations and by isolating the plasmid DNA and identifying the mutants by the presence of the 

introduced restriction site. The mutations arc also confirmed by sequencing with a Sequenase kit. 
The HIV-Lock™ DNA is ckjned into the bacutovirus expression system with a polyhedron promotor. 

Phase Two • Production of HIV-Lock™ Proteins Using Baculovirus. Sf-9 cells are 
cultured to a pre-ctetermined density (sboai IxlO^ cells/ml, log phase), infected with the baculovirus 
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containing the HP/-Lock™ instructions and harvested to recover the recombinant proteins 
comprising the HIV-Lock^. hi the scale-up process, cultures are expanded &om flasks to spinners 
and subsequCTtJy to bioreactors. Following infection the cells are harvested at 12, 24, 36 and 48 
hours for flie protein, indices ofviability are monitored throughout the entire process. 

5 PfaaseThree- Purification of flieHIV-Lock™Protebs The harvested proteins are first 

sspsrstsd fiom particulates by ftow-fhrou^ ultracentri&gatiott to facilitate downstream purification. 
The centrifuged product is then sterile filtered Extracts are then centrifuged at 40,000 rpm at 4"" 
C for 30 minutes and aUquots are immunoprccipitated with polyclonal rabbit antiboc^ against one 
of the HIV-Lodc'r** components, Immunoprccipitated protems are run on an SDS-10% PAGE gel. 

1 0 Hiase Four - Test of HIV-Lock^^rotems Against HTV DNA Mobihty shift assays are 

carried out using an oUgonudeotide probe comprising elements of the HIV long terminal repeat and 
fragments containing NFKB binding DNA associated with kappa hght chain and microglobulin 
regulation. The oligonucleotide is annealed to its complimentary strand and ©3d-labeled with g-^ 
ATP. 

15 Foo^rinting is acconiplished by combining small (10"^^ M) of radiolabeled HIV LTR DNA 

with a slightly larger amount of HIV-Lock™ in a buffer at room teaperature for 10 minutes. 
Dithiothreitol is added prior to the addition of protein. Iron (II), EDTA, l^drogen peroxide and 
sodium ascorbate are added and the reaction mixture is incubated. A quenching agent is added and 
the products are analyzed suing denaturing gel electrophOTcsis. This is done for different 

20 concentrations of protein. The resulting gel is imaged usmg a phosphoimager scanner and the 
resulting high resolution image file is anatyzed to abstract the binding affinity of HIV-Lock™for the 
HIV DNA relative to cellular DNA. 

Multiple design and testing iterations may be used in order to refind binding of HIV-Lock'^** 
and other TBAs for HIV and other organisms. This process makes it possible to design binding 

25 assOTblies such that the bmding assembly is not competitive with the wild type proteins for single 
binding sites in the geiKsme samples. The development of TBAs ibr other organisms and TN As for 
sequences within these organisms can be made usmg the aforem^tioned method. This method is 
valid when producing binding assemblies for all nucleic acid TBRs including DNA-DNA, DNA- 
KNA and KNTA-RNA hybrids and combinations of these hybrids. 

30 

Example 16 » Method for identifiing nucleic acid binding molecules for production of TBAs and 
BBAs of the invention: 

In the method of this invaition, target binding assemblies and booster binding assembhes 
are assembled by idaitifying nucleic acid binding molecules, and linking the nucleic acid binding 
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portions of the molecules in such a fashion as to achieve TBAs which discriminate between 
particular target sequences and even closely related sequences. One method for identifying the 
nucleic acid binding molecules involves the following steps: 

L Obtaining a biological samph containing the target nucleic acid. This could be, for 
example, an organism or a tissue extract infected with a pathogen. 

2. Fragmoitingthe san^le so as to expose the nucleic acids and to reduce the size complexity 
of the nucleic acids contained in tte sample. 

3. Contacting a first aliquot of Ihe fi-agmented nucleic acids with a control bufife* medium and 
contacting a second aliquot of the fragmented nucleic acids with the control buffer medium 
containing a known profile of nucleic acid binding molecules. 

4. Anafyzmg the two aHquots to identify firagments whidi have altered behavior in the aliquot 
contacted with the target binding molecules as opposed to the control aliquot. This is 
accoiiq)Iished by single dimension gel electrophoresis, two dimension gel electrophOTesis, 
high perfomiance liquid chromatpgraphy , paper chromatogr^hy or any other means which 
reveals a different bdiavior of the nucleic acid firagments when bound to a nucleic add 
binding molecule as opposed to when the nucleic acid fragment is unbound 

5. Identifying sik! isolating firagments which do exhibit altered behavior when contacted with 
the nucleic add binding molecule and cither sequoicing the nucleic add firagment to 
detennine whether known nucldc acid binding molecule motifs are preseait, or directly 
identifying the nucldc acid binding molecule bound to the nucleic acid The latter can be 
achieved, fcs: example, by wHitacting a two dim^ional grid of the electrophoresed nudeic 
adds with differentially labded antibodies whidi bind to the various nucleic acid binding 
molecules. 

In this method, preferably nucleic acid motifs arc used for dthcr diagnostic or therapeutic 
purposes wherein the target nudeic acid has more than a single utilizable nucleic acid binding 
molecule target In this w^, a complex target binding assembly can be gcn^ted which takes 
advantage of the proximity of differwU nucleic add binding molecular motifs to enhance the 
spccifidty of the TBA assembled &cm the individual nucleic acid bmding components identified 
The various nucldc acid binding portions of the nucldc acid binding molecules are then assembled 
into the complete TBAs as described above, for example, for HIV-LOCK'^'^. 

Example 17 - Method of identifidng spe cific RNA sequences in a sample 

According to the methods and compositions taught in this invention, any nucleic acid 
scqiKnce can be specifically identified. Identification of target HIV RNA in a sample is adiicved 
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by obtaining a saitg)le of a patient*s blood or other biological fluid or extract which may contain the 
HIV RNA, and testing for the presence of TAR binding sites. Tat is a positive regulator of HIV 
leplicadoii^shidibindstotbeTARregionoftheHIVRNA. The smaUest naturally occurring. My 
active form of HIV-Tat is 72 amino acids in ieigth, SEQ Id 1 18 herein. Tat contains at least two 

5 fimctional dc»nains, and transactivates g^e expression jfrom the HTV long terminal repeat (HIV 

LTR). Tat binds to an RNA stem loop structure fonnedfrcm the self-hybridization of sequ^ 
inTAR,whichisjust5*totheHIVLTR HIV TAR RNA forms a dinucleotide bulge and two stem- 
loop Structures (Rhnn et al. 1994 Viroloa^:202, 202-211). The Tat {SEQ. Id. 118) binds to this 
structure with lower avidity than does Tat variants wherein AIa58 is a threonine or where His65 is 

10 an Asp residue. (Derse et al., 1993 Virology: 194,530-536). Utilizing these facts in the instant 

method is accompli^bed by: 

1 . Fragmenting a biological sample to expcse the nucleic acids and reduce the size complexity 
of the xmcleic adds. 

2 Contacting a TBA with the sample vAMi identifies a hybrid TAR bmding protein sequence 
15 and a pimimate flanking sequence in the HIV genome. The TBA used for this purpose is 

assembled on cro as the chaperone using Tat as the HIV RNA specific binding molecule. 
To provide specificity such that cross-talk between the HIV TAR site and closely related 
TAR sites whidi be ptcssst due to such other pathograis as cytomegalovirus, the TBA 
also has an antibody component which recognizes the DNA-RMA hybrid target b i n d i n g 
20 regicm fonned when a probe nucleic acid binds to the HIV LTR RNA. 

3. Eliminating any "cross-talk" produced by binding of Tat to the TAR region of the HIV 
RNA due to such contaminants (cousin RNAs) as the CMV TAR sequent by contacting 
the reaction with excess Tat variant (either the Ala58 to Thr or the His65 to Asp variants) 
which bind more avidly. In this way, single binding events due to the TBA binding to a 
25 cousin RNAs are competed fi-ora the nucleic acid sample by the Tat variant On the other 

hand, by appropriate^ selecting fte a£5nity of the double binding achieved as a result of the 
ardbocfy and Tat, the TBA ^ not displaced from true targets. This process is illustrated in 
figure 16. In another aspect of this same method, the TBA could be one in which, rather 
than using a variant of Tat, an antibody is used which recognizes this nucleic acid segmait, 
30 and the TBA used is a double antibody TBA. 

In an alt^iate version of this method, a probe nucleic acid may be used which hybridizes 
with the HIV LTR RNA. Accordingfy,adi^lexse^3nentoftheLTRspl sitescanbeo-eatedaspart 
oftiie target binding region. This region ofthe HIV RNA flanks the TAR region which is 5* to the 
LTR but is in close proximity thereto. A TBA containing Tat and two Spl binding units is 
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chapQ'CHiedtoprovi<fe Tat binding to TAR and Spl binding to the Spl binding sites. Amplification 
and detection is then carried out by adding appropriate BNAs, BBAs and HNAs. In yet another 
aitanatrve, PNAs having Seq. ID. 38 and Seq. ID. 39 (see figure 7) could be used. A TEA is used 
vrfridi contains one or more Sp 1 binding units and an antibo^ unit which binds to the DNA-RNA 
5 lybrid produced fit)msan5)leRNA and the Seq Jd 38 PNA. Appropriate BNAs, BBAs and HNAs 
are then added to amplify the signal 

Naturally, those skilled in the art will recognize that other TBA aiMi TNA combinations 
could be used to optimize the njelhods exemplified herein. 

10 It should be understood that the examples and embodiments described herein are for 

illustrative purpose only and that various modifications oc changes in light thereof will be suggested 
to persons skilled in the art and are to be included within the spirit and purview of this application 
and the scope of the appended ciainas. It will be understood that sequences provided herein are 
exemplaiycxily and that c^faer like sequences su^ested by th^^ could be used in be methods of this 

15 invention- It wiU also be imderstoodft^althou^ any sequence provided hsrem 

as linear, it could be used in a circularly or otherwise permuted form and although desi^ated as not 
being anti-s«ise, it could be used in the coding or non-coding form or to bind to coding or non- 
coding con^>lementary sequence. 
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SEQUENCE LISTING 



(1) 



XNFORK&TZONS 



(i) 



jICANT: 



Applicant Kaioe(s): 
stzteet address : 
city : 
State/Province : 
country : 
Postdl code/Zip: 
Pijpne numiber: 



THE GENE POOL, INC* 

300 Queen Anne Ave, Suite 392 

Seattle 

Washington 

us 

98109-4599 

(206) 526-8617 Fax number: 



(ii) TITIii OF INVENTION: METHOD OF DETECTION OF NUCLEIC ACIDS WITH A 
3CIFIC SEQUENCE COMPOSITION 

(iii) NUMBEk OF SEQUENCES: 118 



(iv) CORREj^ONDKNCE ADDI^SS: 

(A) ADDRBSSES: Saliwanchik & Saliwanchik 

(B) STREET: 2421 N.W* 4i3t St., Suite A-1 

(C) c:^TY$ Gainesville 

(D) STATE t Florida 

(E) COyNTRV: USA 

(F) ZI^: 32606 



(V) COHPUTEil READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPE^TISG SYSTEM: PC-DOS/KS-DOS 
{D) SOFa|»ARE: Patentin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPL^f CATION NUMBER: 

(B) FILING DATE: 
<C) CLASS'^FICATIOK: 

{Viii> ATTORKSY/A(IeNT INFORMATION: 

(A) NAME: Wencen, Gerard E 

(B) REGISTRATION NtJMBER: 35,746 

(C> REFERENCE/DOCKET NUMBER: GP-lOO.Cl 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (904) 375-8100 

(B) TELEFAX: (904) 372-5800 



(2) INFORMATIOK FOR SEQ ID N0:1: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1^ base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: \linear 
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MOLECULE TYPEi CDNA 
(iii) HyPOTHETICAL: NO 



'I-SENSE; NO 
<xi) S^QtFENCE DESCRIPTION: SEQ ID NO:l: 
TCGGGATTCC ICCA 



{2> INFORMATION FOR SEQ ID NO: 2: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 13 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: both 
(b) TOPOLOGY: linear 



(iii) HYP 



(ii) MO^iECULE TYPE: CDRA 

»OTHETICAL: NO 
(iv) ANTI -SENSE: NO 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

AAGGGACTTT CCC 

1 
I 

(2) INFORMATION FOR SEQ ID N0:3: 

i 

(i) SEQyENCS CHARACTERISTICS: 
(A^ LENGTH: 13 base pairs 
(B) TYPE: nucleic acid 

{€} STRANDEDNESS: both 
{V) TOPOLOGY: linear 

(ii) M0L2;CULE TYPE: CDNA 

(iii) HYP6theTICAL: NO 

(iv) ANTIfSENSE: NO 

(Xi) SEQuilNCE DESCRIPTION: SEQ ID NO: 3 J 
AGGGGACTTT CCG 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) £*BNGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) *I^OPOLOGY: linear 
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tii) MOLECUIiE TYPE: CDNA 

(iki) hypothetical: no 

<iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GCTGGGGACT TTCCA 



(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS! 
\ (A) liENGTH: 15 base pairs 
\ (B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

\ (D) TOPOLOGY: lineair 

t 

1 

(ii) MOLECULE TYPE: cDNA 

\ 

(iii) jSYPOTHETICAL: NO 

(iv) i^KTI -SENSE: NO 

i 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
ACAAGGGACTi TTCCG 



(2) INFORMATION FOR SEQ ID N0:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
(l^) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETIC!^: NO 

<iv) ANTlisENSE: NO 

<xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 6: 
CCGGGTTTTC CCC ■ 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: both 

(D) TOPOLOGY: linear 

I 
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{hi) KOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL S KO 
(iv\ AKTI-SENSE: NO 

(Xi)i SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
AAGGGACT|?T CCGCTGGGGA CTTTCCA 

<2) INFO^TION FOR SEQ ID NO:8: 

(i) IsEQUENCE CHARACTERISTICS: 

1 (A) LENGTH: 27 base pairs 

1 (B) TYPE: nucleic acid 

i (C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) ^POTHETICAL: NO 

{iv> ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 : 
AAGGGACTTT CCGCTGGGGA CTTTCCG 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
{iv> ANTI^SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GCTGGGGACT TTCCAGGGAG GCGTGG 

<2) INFORMATION FOR SEQ ID NO.:i£>: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) oIypE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 
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(ii) \MOI*ECOLE TTFE: CDNA 
(iii) IhYFOTHETIC&L: NO 
(iv) klTTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SSQ ID NO: 10: 
GCTGGGGAc4 TTCCAGGGGA GGTGTG 



(2) IKFORJ^TION FOR SEQ ID NO slit 

1 

<i) SEgUENCE CHARACTERISTICS: 

\A) LENGTH: 26 base pairs 
|b) TYPEt nucleic acid 

STRANDEDNESS: both ' 
(to) TOPOLOGY: linear 

V 

(ii) MOLECULE TYPE: cDNA 

i 

(iii) HYPOTHETICAL: NO 

I 

{iv> ANTj-SENSE: NO 

\ 

i 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 11: 
GCTGGGGACT TTCCGGGGAG CGTGGC 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 
(iii) HYPOTriETICAL: NO 
(iv) ANTI-SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GCTGGGGACT TTCCGGGGAG GCGC6G 



(2) INFORMATION FOR SEQ ID NO: 13: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
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(D) TOPOLOGY: linear 
^(ii) MOLECULE TYPE: cDNA 
(lii) HYPOTHETICAL: NO 
(iv) ANTX-SENSBs NO 

SEQUEUCE DESCRIPTION: SEQ ID NO: 13: 



(4> 



GCTGGO(^CT TTCCAGAGAG GCGTGG 26 

\ 

\ 
s 

(2) INFC^OIATION FOR SEQ ID NO: 14: 

\ 

(i) I SEQUENCE CHAR&CTEHISTXCS : 
I (A) LENGTH: 26 base pairs 
\ <B) TYPE: nucleic acid 

; (C) STKANDEDNSSS: both 

\(D) TOPOLOGY: linear 

(ii) WLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

I 

(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 14: 

\ 
i 

GCTGGGGACT !^TCCAGGGGA GGCGTG 26 

\ 
\ 

\ 

(2) INFORMATION FOR SEQ ID NO: 15: 

\ 

(i) SEQl^BNCE CHARACTERISTICS: 

(A) l LENGTH: 26 base pstirs 

(B) I TYPE: nucleic acid 

(C) ^STRANDEDNESS: both 

(D) ItopologY: linear 

(ii) MOLBcbliE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

\ 

(iv) ANTI-S^SE: NO 

\ 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GCTGGGGACT TTCCAGGGAG GCGTGG 26 



(2) INFORMATION FpR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE:\ nucleic acid 

\ 
1 

J 



wo 96/17956 



PCTAJS95/15944 



69 



(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) ^OLECULE TYPE: CDNA 
(iii) SYPOTHBTICAL: NO 

(iV) ANTI-SENSE: NO 

(Xi) SEQTJENCE DESCRIPTION: SEQ ID NO: 16s 
GCTGGGGACT WtCCAGGGAG OCTGCC 

(2) INFORMATION FOR SEO ID N0sl7: 

(i) SE^OENCE CHARACTERISTICS: 

(1) LENGTH: 33 base pairs 
<b1) TYPE: nucleic acid 
(ci) STRANDEDNBSS : both 
(D) TOPOLOGY 5 linear 
i 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

i 

(iv) ANTll-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

\ 

TTTCCAGGGA GGCGTGGCCT GGGCGGGACT GGG 

\ 
\ 

(2) INFORMATION FOR SEO ID NO: 18: 

(i) SEQUElicE CHARACTERISTICS: 

(A) :4eNGTH: 33 base pairs 

(B) tJypE: nucleic acid 

(C) S^TRANDEDKESS: both 

(D) T|5P0L0GY: linear 

I 

(ii) MOLECULE TYPE: cDNA 

i 

(iii) HYPOTH^ICAL: NO 
(iv) ANTI-S^SE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO? 18: 

1 

CGTGGCCTGG GCGGGkCTGG GGAGTGGCGT CCC 

\ 
\ 

(2) INFORMATION I^R SEQ ID NO: 19: 

I 

(i) sequence! CHARACTERISTICS: 
(A) LENGTH: 45 base pairs 
(B> TYPE: nucleic acid 
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(C) STRANDEDNESS: both 
P) TOPOLOGY: linear 

{iiV MOLECULE TYPE: CDNA 

(iii)'i HYPOTHETICAL: NO 



(iv) Wti-sense: 



(iV) lANTI-SENSE: NO 

<xi) ksQUENCE DESCRIPTION: SEO ID NO;19j 

ctacaagggA ctttccgctg gggactttcc agggaggcgt ggcct 

<2) information for seq id no:20s 

(i) SEpUENCE CHARACTERISTICS s 
(A) LENGTH: 46 base pairs 
(k) TYPE: nucleic acid 

(Q) STRANDEDNESS : both 

(4) TOPOLOGY: linear 



(ii) MOLfeCULE TYPE: CDNA 
THETICAL: NO 



(ii) MOLECl] 
(iii) HYP^E 



(iv) ANT:)^ -SENSE: NO 

(xi) seqt^ce description; seq id no: 20: 
cagcaaggga ctttccgctg gggactttcc aggggaggtg tggcct 

\ 

(2) INFORMATI<iN FOR SEQ ID N0:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) pEKGTH: 46 base pairs 

(B) |rYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) ^OPOLOGY: linear 

(ii) MOLECx]|liE TYPE: cDNA 
(iii) HYPOT^TICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEOXJENC^ DESCRIPTION: SEQ ID NO: 21: 
CATCAAGGGA CTTTclpGCTG GGGACTTTCC AGGGGAGGTG TGGCCT 

\ 

(2) IISTFORMATION E^^R SEQ ID NO; 22: 

(i) SEQireiNCEi CHARACTERISTICS : 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE CTPE: CDKA 
(iki) HYPOTHETICAL: NO 
(iv) AKTI -SENSE: NO 



T 

(xi) SEOUENCE DESCRIPTION: SEQ ID NO :22s 
CAACAACffiGA CTTTCCGCTG GGGACTTTCC AGGGGAGGTG TGGCCT 



(2) INFORMATION FOR SEO I& NO: 23: 

i 

SE^tJENCE CHARACTERISTICS: 
1 (A) LENGTH: 45 base pairs 
\ <B) TYPE: nucleic acid 
\ (C> STRANDSDNSSS : both 

i (D) TOPOLOGY: linear 



(ii) ^MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

\ 

(iv) ANTI -SENSE: NO 

t 

(Xi) diEQUENCE DESCRIPTION: SEQ ID NO: 23; 
CTACAAGGGAI CTTTCCGCTG GGGACTTTCC AGGGAGGCGT GGCAT 

(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 
(Ja) LENGTH: 44 base pairs 
{'b) TYPEt nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

\ 

(iii) HYPOTHETICAL: NO 

\ 
I 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CTACAAGGGA CTTTCCGCTG GGGACTTTCC GGGGAGCGTG GCCT 



\ 

12) INFORMATIO^r FOR SEQ ID NO: 25: 

\ 

(i) SEQUE^^CE CHARACTERISTICS: 
(A) LENGTH: 44 base pairs 



J 



wo 96/17956 • ^ PCT/US95/15944 



72 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(im ANTI -SENSE: NO 

<xi\ SEQUENCE DESCRIPTIONS SBQ ID NO;25 i 
CTACAAGQGA CTTTCCGCTG GGGACTTTCC GGGGAGGCGC GGCT 

(2) INPORMATION FOR SHQ ID NO: 26: 

(i) \sEQUEKCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

i(C} STRANDEDNESS: both 

WD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

\ 

(iii) HYPOTHETICAL: NO 
(iv> AK|!I -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

\ 

CTACAAGGGA CTTTCCGCTG GGGACTTTCC AGAGAGGCGT GGACT 

i 

} 

(2) INFORMATION FOR SEQ ID NO: 27: 

i 

(i) SEQUENCE CHARACTERISTICS: 
(A)! LENGTH: 46 base pairs 
{B)l TYPE: nucleic acid 
{C)\ STRANDEDNESS: both 
( D ) I TOPOLOGY : linear 

(ii) MOLECSULE TYPE: cDNA 

i 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-pENSEt NO 

j 

(xi) SEQUEilCE DESCRIPTION: SEQ ID NO: 27: 
CTACAAGGGA CTTTCCGCTG GGGACTTTCC AGGGGAGGCG TGGACT 



(2) INFORMATION I FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRAKDEDNESS : both 

(D) TOPOLOGY: linear 

<ii)\ MOLECULE TYPE: CDWA 
(iii) iHYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(Xi) SEOUENCE DESCRIPTION: SEQ ID NO: 28: 
CTACAGGGGa CTTTCCGCTG GGGACTTTCC AGGGAGGCGT GGGGAG 



(2) INFORMATION FOR SEO ID NO:29: 

(i) SEDUENCE CHARACTERISTICS: 

<A) LENGTH: 43 base pairs 
(^) TYPE: nucleic acid 
(G) STRANDEDNESS s both 
(Dl) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

\ 

(iii) HYPdjTHETICAL: NO 
(iv) AKTIrSENSE: NO 

(xi) SEQt^NCB DESCRIPTION: SEQ ID NO: 29: 

\ 

CTACAGGGGA CTTbPCCGCTG GGGACTTTCC AGGGAGGCTG CCT 

\ 

(2) INFORMATIOlk FOR SEQ ID NO: 30: 

(i) SEQXJEliCE CHARACTERISTICS: 

(A) i^gth: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

( ii ) MOLECThIe TYPE : cDNA 

i. 

<iii) HYPOTHETICAL: NO 

\ 

(iv) ANTI -SENSE: NO 

\ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

\ 

CTTTCCGCTG GGGACTTTCC AGGGAGGCGT GGCCTGGGCG GGACTGGG 

■ 



(2) INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE dpARACTERISTICS: 
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<A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
{D) TOPOlLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 31: 
TTTCCAGGGA GGCGTGGCCT GGGCGGGACT GGGGAGTGGC GTCCC 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CTACAAGGGA CTTTCCGCTG GGGACTTTCC AGGGAGGCGT GGCCTGGGCG GGACTGGGG 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPEt CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
TTTCCGCTGG GGACTTTCCA GGGAGGCGTG GCCTGGGCGG GACTGGGGAG TGGCGTCCC 



(2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRAKDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION : SEO ID NO: 34; 
CTACAAGGGA CTTTCCGCTC GGGACTTTCC AGGGAGGCGT GGCCTGGGC6 GGACTGGGGA 
GTCGCGTCCC 



(2) INFORMATION FOR SBQ ID NO: 35: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
TATCACCGCC AGTGGTATTT ATGTCAACAC CGCCAGAGAT AATTTATCAC CGCAGATGGT 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

( D ) TOPOLOGY : linear 

<ii) MOLECUXiE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
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TATCACCGCA AGGGATMAT ATCTAACACC GTGCGTGTTG ACTATTTTAC CTCTGGCGGT 



60 



GATA 



64 



(2) INFORMATION FOR SBQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 
(A> LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDKA 
(iii> HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
CTACAAGGGA CTTTCCGCTG GGG&CTTTCC ACGGAGGCGT GGCCTG6GCG GG&CTGGGGA 60 
GTGGCGTCCC 70 



(2) INFORMATION FOR SEQ ID NO:38: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 37 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE s NO 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
CTACAAGGGA CTTTCCGCTG GGGACTTTCC AGGGAGG 37 



<2> INFORMATION FOR SEQ ID NO: 39: 

(1) SEQUENCE CHARACTERISTICS: 

(A) I.ENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOiiOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(iii) HYPOTHETICAL: NO 
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(iv) ANTl -SENSES NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO :3d: 



C6G6ACTGGG GAGT6GCGTC CO 



22 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 base pairs 
(B> TYPE: nucleic acid 
<C) STRANDSDNESS : both 
<D) TOPOLOG?: linear 

{ii> MOIiECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

<iv} ANTI-SENSE: NO 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
CTACAAGGGA CTTTCCGCTG GGGACTTTCC AGGGAGGTAT CACCGCCAGT GGTATTTATG 60 
TCAACACCGC CAGAGATAAT TTATCACCGC AGATGGTTCT GCA 103 



{2) IKPORMATION FOR SSQ ID NO: 41: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY; linear 

(ii) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
GAACCATCTG CGGTGATAAA TTATCTCTGG CGGTGTTOAC ATAAATACCA CTGGCGGTGA 60 



(2) INFORMATION FOR SEQ ID NO: 42: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 71 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 



TA 



62 
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(ii) MOLECULE TYPE: CDNA 



(iii) HYPOTHETICAL: NO 



(iv) AKTI-SENSE: NO 



<xi> SEQtJENCE DESCRXPTtOWs SEQ ZD NO: 42: 



GATCCAACCA TCTGCGGTGA TAAATTATCT CTGGCGGTGT TGACATAAAT ACCACTGOCG 



60 



GTGATACTGC A 



71 



(2) IKFOItMATlOia FOR SSO ID WO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE I cDKA 

(iii) HYPOTHETICAL: NO 



(iv) AKTI -SENSE: NO 

(Xi) SEOUENCE DESCRIPTION: SEQ ID NO: 43: 
GTATCACCGC CAGTGGTATT TATGTCAACA CCGCCAGAGA TAATTTATCA CCGCAGATGG 60 
TTG 63 



(2) INFORMATION FOR SEQ ID NO: 44: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SSNSEl NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
GATCCGGGGG GATACCCCCC G 



(2) INFORMATION FOR SEQ ID NO: 45: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL s NO 
(iv) ANTI -SENSES NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NOi45: 
CGGGACTGGG GAGTGGCGTC CCTATCACCG CAAGGGATAA ATATCTAACA CCGTGCGTGT 60 
TGACTATTTT ACCTCTGGCG GTGATAGCAT G 91 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
CTAAGGGCGT AACCGAAATC GGTTGAACCG AAACCGGTTA GTATAAAAGC AGA 53 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iV) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 
AAAAGGGAGT AACCGAAAAC GGTCGGGACC GAAAACGGTG TATATAAAAG ATGT 54 

(2) INFORMATION FOR SEQ ID NO: 48: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) IJSNGa?Ht 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRAN0EDMESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTX-SEKSE: KO 

(xi) SBftUENCE DESCRIPTION: SEQ ID NO: 48: 
AGTAGGGTGT AACCGAAAGC GGTTCAACCG AAAACGGTGC ATATATAAAG CAAA 54 



(2) INFORMATION TOR SEQ ID NO:49: 

(i) SEQUEfrCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; both 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: KO 
(iV) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
GCTTCAACCG AATTCGGTTG CATG 24 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY! linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: KO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 
TGTGCAACCG ATTTCGGTTG CCTT 24 



(2) INFORMATION FOR SEQ ID NO: 51: 
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(i) SEQUENCE CHARACTBRISTICS: 
(A) IiBNGTH: 24 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNSSS : both 

(D) TOPOLOGY: linear 

(ii) MOLECUIiE TYPK: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) AKTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
TATGCAACCG AAATAGGTTG GGCA 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
TGCCTAACCG TTTTCGGTTA CTTG 



(2) INFORMATION FOR SEQ ID NO: 53: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 



GGACTAACCG TTTTAGGTCA TATT 
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(2) INFORMATION FOR SEO ID NO: 54: 

(i) SEOUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
GACGACTATC CAGCGACCAA GATCAGAGCC AGACACCGGA AACCCCTGCC AC 



(2) INFORMATION FOR SEQ ID NO: 55: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; cDNA 
(iii) HYPOTHETICAL: NO 
(iv> ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
GAC6ACACGG TATCCGCTAC TCAGCTTGTT AAACAGCTAC AGCACACCCC CTC 



(2) INFORMATION FOR SEQ ID NO: 56: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
6ACGACGACC TGCAGACACC ACAGACACCG CCCAGCCCCT TACAAAGCTG TTCTGTGCAG 
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(2) INFORMATION FOR SEQ ID NO: 57: 
(i) SBOtJENCE CHARACOfERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOI^OGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(Xi) SEOOENCE DESCRIPTION: SEQ ID NO: 57: 
CATACCAAAG CCGTC6CCTT GG6CACCGAA GAAACACAAC CACTAAGTTG TTGCACAGAG 60 
ACTCAGTG $8 



(2) INFORMATION FOR SEQ ID NO: 58: 

<i) SEQXJBNCE CHARACTERISTICS: 
<A) LENGTH: 77 base pairs 
(B) TYPE: nucleic acid 

(C> STE5ANDBDNESS : both 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

<iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
TAATGTAATT GATTGTAATG ACTCTATGTG CAGTACCAGT ACCGTATTCC AGCACCGTGT 60 
CCGTGGGCAC CGCAAAG 77 



(2> INFORMATION FOR SEQ ID NO: 59: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOX.OGy; linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59; 
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ACAGACAACG ATAACCGACC ACCACAAGCA GCGGCCAAAC ACCCCGCCTT GGACAATAGA 60 
ACA6CACGTA CTGCAACTAA 80 



<2) INFORMATION FOR SEQ ID NO:60j 

<i) SEQUENCE CHARACTERISTICS: 

<A) IiENGTH: 26€ base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 
(ii) MOI»ECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTX-SENSEt NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 



CATATGCAAT ACAATGCATT ATACAAACTG GACACATATA TATATTTGTG AAGAAGCATC 60 

AGTAACTGTG GTAGAGGGTC AAGTTGACTA TTATGGTTTA TATTATGTTC ATGAAGGAAT 120 

ACGAACATAT TTTOTGCAGT TTAAAQATGA TOCAGAAAAA TATAGTAAAA ATAA&GTATG 180 

GGAAGTTCAT GCGGGTGGTC AGGTAATATT ATGTCCTACA TCTGTGTTTA GCAGCAACGA 240 

AGTATCCTCT CCTGAAATTA TTAGGC 266 



(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CH2^CTERISTICS: 

(A) IiENGTH: 95 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; both 

(D) TOPOLOGY* linear 
(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
AGGATGTATA AAAAAACATG GATATACAGT GGAAGTGCAG TTTGATGGAG ACATATGCTA 60 
TTAGGCAGCA CTTGGCCAAC CACCCCGCCG CGACC 95 



(2) INFORMATION FOR SEQ ID NO: 62: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TY?Es nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGX: linear 

(ii) MOLECUI.E TYPE: CDNA 
(iii> HYPOTHETICAL: NO 
{iv> ANTI-SENSE: NO 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
CATGTTTTTT TATACATCCA TATCACCGCC AGTGGTATTT ATGTCAACAC CGCCAGAGAT 
AATTTATCAC CGCAGATGGT T 



(2) INFORMATION FOR SEQ ID NO:63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 322 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(V) FRAGMENT TYPE: internal 

{xi> SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Met Ala Asp Asp Asp Pro Tyr Gly Thr Gly Gin Met Phe His Leu Asn 
1 5 10 15 

Thr Ala I»eu Thr His ser lie Phe Asn Ala Glu Leu Tyr Ser Pro Glu 
20 25 30 

lie Pro Leu ser Thr Asp Gly Pro Tyr Leu Gin lie Leu Glu Gin Pro 
35 40 45 

Lys Gin Arg Gly Phe Arg Phe Arg Tyr Val Cys Glu Gly Pro Ser His 
50 55 60 

Gly Gly Leu Pro Gly Ala Ser ser Glu Lys Asn Lys Lys Ser Tyr Pro 
65 70 75 80 

Gin Val Lys lie Cys Asn Tyr Val Gly Pro Ala Lys Val lie Val Gin 
85 90 95 

Leu Val Thr Asn Gly Lys Asn lie His Leu His Ala His Ser Leu Val 
100 105 110 
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Gly Lys His Cys Glu Asp Gly Val cys Thr val Thr Ala Gly Pro Lys 
115 120 125 

Asp Met Val Val Gly Phe Ala Asn Leu Gly He Leu His Val Thr Lys 

130 135 140 

Lys Lys Val Phe Glu Thr Leu Glu Ala Arg Met Thr Glu Ala Cys lie 
145 150 155 160 

Arg Gly Tyr Asn Pro Gly Leu Leu Val His ser Asp Leu Ala Tyr Leu 
165 170 175 

Gin Ala Glu Gly Gly Gly Asp Arg Gin Leu Thr Asp Arg Glu Lys Glu 
ISO 185 190 

He He Arg Gin Ala Ala val Gin Gin Thr Lys Glu Met Asp Leu Ser 
195 200 205 

val val Arg Leu Met Phe Thr Ala Phe Leu Pro Asp Ser Thr Gly Ser 
210 215 220 

Phe Thr Arg Arg Leu Glu Pro Val Val Ser Asp Ala He Tyr Asp Ser 
225 230 235 240 

Lys Ala Pro Asn Ala Ser Asn Leu Lys He val Arg Met Asp Arg Thr 
245 250 255 

Ala Gly cys val Thr Gly Gly Glu Glu He Tyr Leu Leu cys Asp Lys 
260 265 270 

val Gin Lys Asp Asp He Gin He Arg Phe Tyr Glu Glu Glu Glu Asn 
275 280 285 

Gly Gly val Trp Glu Gly Phe Gly Asp Phe ser Pro Thr Asp Val His 
290 295 300 

Arg Gin Phe Ala He Val Phe Lys Thr Pro Lys Tyr Lys Asp Val Asn 
305 310 315 320 

He Thr 



(2) INFORMATION FOR SEO ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 325 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii> HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(V) FRAGMENT TYPE: internal 



^1 

o 
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(xi) SEQUENCE DESCiaPTION: SEQ ID NO: 64: 

Met Ala Glu Asp Asp Pro Tyr Leu Gly Arg Pro Glu Gin Met Phe His 
1 S 10 15 

Leu Asp Pro Ser Leu Thr His Thr lie Phe Asn Pro Glu Val Phe Gin 
20 25 30 



Pro Gin Met Ala Leu Pro Thr Ala Asp Gly Pro Tyr Leu Gin He Leu 
35 40 45 

Glu Gin Pro Lys Gin Arg Gly Phe Arg Phe Arg Tyr Val Cys Glu Gly 
50 55 60 

Pro Ser His Gly Gly Leu Pro Gly Ala ser ser Glu Lys Asn Lys Lys 
65 70 75 80 

Ser Tyr Pro Gin Val Lys He Cys Asn Tyr Val Gly Pro Ala Lys Val 
85 90 95 

He Val Gin Leu Val Thr Asn Gly Lys Asn He His Leu His Ala His 
100 105 110 

Ser Leu Val Gly Lys His Cys Glu Asp Gly lie Cys Thr Val Thr Ala 
115 120 125 

Gly Pro Glu Asp cys val His Gly Phe Ala Asn Leu Gly He Leu His 
130 135 140 

Val Thr Lys Lys Lys Val Phe Glu Thr Leu Glu Ala Arg Met Thr Glu 
145 150 155 160 

Ala cys He Arg Gly Tyr Asn Pro Gly Leu Leu Val His Pro Asp Leu 
165 170 175 

Ala Tyr Leu Gin Ala Glu Gly Gly Gly Asp Arg Gin Leu Gly Asp Arg 
180 185 190 

Glu Lys Glu Leu lie Arg Gin Ala Ala Leu Gin Gin Thr Lys Glu Met 
195 200 205 

Asp Leu Ser Val Val Arg Leu Met Phe Thr Ala Phe Leu Pro Asp Ser 
210 215 220 

Thr Gly Ser Phe Thr Arg Arg Leu Glu Pro Val Val Ser Asp Ala He 
225 230 235 240 

Tyr Asp Ser Lys Ala Pro Asn Ala Ser Asn Leu Lys He Val Arg Met 

245 250 255 

Asp Arg Thr Ala Gly cys Val Thr Gly Gly Glu Glu He Tyr Leu Leu 
260 265 270 



Cys Asp Lys Val Gin Lys Asp Asp He Gin He Arg Phe Tyr Glu Glu 
275 280 285 
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Giu Glu Asn Gly Gly val Trp Glu Gly Phe Gly Asp Phe Ser Pro Thr 
290 295 300 

Asp val His Arg Gin Phe Ala lie Val Phe Lys Thr Pro Lys Tyr Lys 
305 310 315 320 

Asp lie Asn lie Thr 
325 



(2) INFORMATION FOR SZQ ID NOt65: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) XiENGTH: 268 amino acids 

(B) TYPE: amino acid 
{ D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

{V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

Met Glu Pro Ala Asp Leu Leu Pro Leu Tyr Leu Gin Pro Glu Trp Gly 
15 10 15 

Glu Gin Glu Pro Gly Gly Ala Thr Pro Phe Val Glu lie Leu Glu Gin 
20 25 30 

Pro Lys Gin Arg Gly Met Arg Phe Arg Tyr Lys Cys Glu Gly Arg Ser 
35 40 45 

Ala Gly Ser lie Pro Gly Glu His ser Thr Asp Ser Ala Arg Thr His 
50 55 60 

Pro Thr lie Arg Val Asn His Tyr Arg Gly Pro Gly Arg Val Arg Val 
65 70 75 80 

Ser Leu Val Thr Lys Asp Pro Pro His Gly Pro His Pro His Glu Leu 
85 90 95 

Val Gly Arg His Cys Gin His Gly Tyr Tyr Glu Ala Glu Leu Ser Pro 
100 105 110 

Asp Arg Ser lie His Ser Phe Gin Asn Leu Gly lie Gin Cys Val Lys 
115 120 125 

Lys Arg Glu Leu Glu Ala Ala Val Ala Glu Arg lie Arg Thr Asn Asn 
130 135 140 

Asn Pro Phe Asn Val Pro Met Glu Glu Arg Gly Ala Gin Tyr Asp Leu 
145 150 155 160 
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ser Ala val Arg Leu cys Phe Gin val Trp Val Asn Gly Pro Gly Gly 
165 170 175 

Leu Cys Pro Leu Pro Pro val Leu ser Gin Pro lie Tyr Asp Asn Arg 
180 185 190 

Ala Fro Ser Thr Ala Glu Leu Arg He Leu Pro Gly Asp Arg Asn ser 
195 200 205 

Gly Ser Cys Gin Gly Gly Asp Glu He Phe Leu Leu Cys Asp Lys val 
210 215 220 

Gin Lys Glu Asp He Glu Val Arg Phe Trp Ala Glu Gly Trp Glu Ala 
225 230 235 240 

Lys Gly ser Phe Ala Ala Ala Asp Val His Arg Gin Val Ala He Val 
245 250 255 



Phe Arg Thr Pro Pro Phe Arg Glu Arg Ser Leu Arg 
260 265 



(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 amino acids 

(B) TYPE: aiaino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Met Asp Asp Leu Phe Pro Leu He Phe Pro ser Glu Pro Ala Gin Ala 
1 5 10 15 

ser Gly Pro Tyr Val Glu He He Glu Gin Pro Lys Gin Arg Gly Met 
20 25 30 

Arg Phe Arg Tyr Lys Cys Glu Gly Arg Ser Ala Gly Ser He Pro Gly 
35 40 45 

Glu Arg Ser Thr Asp Thr Thr Lys Thr His Pro Thr He Lys He Asn 
50 55 60 

Gly Tyr Thr Gly Pro Gly Thr Val Arg lie Ser Leu Val Thr Lys Asp 

65 70 75 80 
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pro Pro His Arg Pro His Pro Sis Glu I.eu val Gly Lys Asp Cys Arg 
85 90 95 

Asp Gly !ryr Tyr Glu Ala Asp Leu cys Pro Asp Arg ser lie His Ser 

100 105 

Phe Gin Asn Leu Gly He Gin cys val Lys Lys Arg Asp Leu Glu Gin 
115 120 125 

Ala He ser Gin Arg He Gin Thr Asn Asn Asn Pro Phe His Val Pro 
130 135 140 

lie Glu Glu Gin Arg Gly Asp Tyr Asp Leu Asn Ala Val Arg Leu Cys 
145 150 155 160 

Phe Gin Val Thr Val Arg Asp Pro Ala Gly Arg Pro Leu Leu Leu Thr 
165 170 175 

Pro Val Leu ser His Pro He Phe Asp Asn Arg Ala Pro Asn Thr Ala 
180 185 190 

Glu Leu Lys He Cys Arg val Asn Arg Asn Ser Gly ser cys Leu Gly 
195 200 205 

Gly Asp Glu He Phe Leu Leu Cys Asp Lys Val Gin Lys Glu Asp He 
210 215 220 

Glu val Tyr Phe Thr Gly Pro Gly Trp Glu Ala Arg Gly Ser Phe Ser 
225 230 235 240 

Gin Ala Asp Val His Arg Gin val Ala He val Phe Arg Thr Pro Pro 
245 250 255 

Tyr Ala Asp Pro ser Leu Gin 
260 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEOtlEMCE CHARACTERISTICS: 

(A) LENGTH: 263 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v> FRAGMENT TYPE: internal 

{Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

Met Asp Glu Leu Phe Pro Leu He Phe Pro Ala Glu Pro Ala Gin Ala 



1 
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15 



W096/17956 - ' PCr/US95/15944 



91 

Ser Gly Pro Tyr Val Glu lie lie Giu Gin Pro Lys Gin Arg Gly Met 

20 25 30 

Arg Phe Arg Tyr Lys Cys Glu Gly Arg Ser Ala Gly Ser lie Pro Gly 
35 40 45 

Glu Arg Ser Thr Asp Thr Thr I.ys Thr His Pro Thr lie Lys lie Asn 
50 55 60 

Gly Tyr Thr Gly Pro Gly Thr Val Arg He ser Leu Val Thr Lys Asp 
65 70 75 80 

Pro Pro Bis Arg Pro His Pro His Glu Leu Val Gly Lys Asp cys Arg 
85 90 95 

Asp Gly Phe Tyr Glu Ala Glu Leu cys Pro Asp Arg Cys He His Ser 
100 105 110 

Phe Gin Asn Leu Gly He Gin Cys Val Lys Lys Arg Asp Leu Glu Gin 
115 120 125 

Ala lie ser Gin Arg lie Gin Thr Asn Asn Asn Pro Phe Gin val Pro 
130 135 140 

He Glu Glu Gin Arg Gly Asp Tyr Asp Leu Asn Ala Val Arg Leu cys 
145 150 155 160 

Phe Gin val Thr val Arg Asp Pro ser Gly Arg Pro Leu Arg Leu Pro 
165 170 175 



Pro val Leu Pro His Pro He Phe Asp Asn Arg Ala Pro Asn Thr Ala 
180 185 190 

Glu Leu Lys He cys Arg Val Asn Arg Asn ser Gly ser Cys Leu Gly 
195 200 205 

Gly Asp Glu lie Phe Leu Leu Cys Asp Lys val Gin Lys Glu Asp He 
210 215 220 

Glu Val Tyr Phe Thr Gly Pro Gly Trp Glu Ala Arg Gly ser Phe ser 
225 230 235 240 

Gin Ala Asp Val His Arg Gin val Ala He val Phe Arg Thr Pro Pro 
245 250 255 

Tyr Ala Asp Pro ser Leu Gin 
260 



(2) INFORMATION FOR SEQ ID NO: 68 J 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 299 amino acids 

(B) TYPE: aznino acid 
(D) TOPOLOGY: linear 



wo 96/17956 




PCT/US95/15944 



<ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO; 68: 

Met Phe Pro Asn Gin Asn Asn Gly Ala Ala Pro Gly Gin Giy Pro Ala 
1 5 10 15 

Val Asp Gly Gin Gin Ser ten Asn Tyr Asn Gly Leu Pro Ala Gin Gin 



Gin Gin Gin Leu Ala Gin Ser Thr Lys Asn Val Arg Lys Lys Pro Tyr 
35 40 45 

Val Lys lie Thr Glu Gin Pro Ala Gly Lys Ala Leu Arg Phe Arg Tyr 
50 55 60 

Glu cys Glu Gly Arg ser Ala Gly ser lie Pro Gly val Asn ser Thr 
65 70 75 80 

Pro Glu Asn Lys Thr Tyr Pro Thr He Glu He Val Gly Tyr Lys Gly 
85 90 95 

Arg Ala Val Val val Val ser Cys Val Thr Lys Asp Thr Pro Tyr Arg 

100 105 110 

Pro His Pro His Asn Leu Val Giy Lys Glu Gly Cys Lys Lys Gly Val 
115 120 125 

Cys Thr Leu Glu He Asn ser Glu Thr Met Arg Ala Val Phe Ser Asn 
130 135 140 

Leu Gly He Gin cys val Lys Lys Lys Asp He Glu Ala Ala Leu Lys 
145 150 155 160 

Ala Axg Glu Glu He Arg Val Asp Pro Phe Lys Thr Gly Phe Ser His 
165 170 175 

Arg Phe Gin Pro Ser Ser He Asp Leu Asn ser Val Arg Leu Cys Phe 
180 185 190 

Gin Val Phe Met Glu Ser Glu Gin Lys Gly Arg Phe Thr Ser Pro Leu 
195 200 205 

Pro Pro val Val ser Glu Pro He Phe Asp Lys Lys Ala Met Ser Asp 
210 215 220 

Leu Val He Cys Arg Leu Cys Ser Cys Ser Ala Thr Val Phe Gly Asn 
225 230 235 240 

Thr Gin He He Leu Leu Cys Glu Lys Val Ala Lys Glu Asp He Ser 



20 



25 



30 



245 



250 



255 
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Val Arg Phe Phe Glu Glu Lys Asn Gly Gin Ser Val Trp Glu Ala Phe 
260 265 270 

Gly Asp Phe Gin His Thr Asp val His Lys Gin Thr Ala lie Thr Phe 
275 280 285 

Lys Thr Pro Arg Tyr His Thr Leu Asp He Thr 
290 295 



(2) INFORMATION FOR SEQ ID NO:69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 261 amino acids 

(B) TYPE: amino acid 
<D) topology: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

Met Asp Phe Leu Thr Asn Leu Arg Phe Thr Glu Gly He ser 
15 10 

Tyr He Glu He Phe Glu Gin Pro Arg Gin Arg Gly Thr Arg 
20 25 30 

Tyr Lys Cys Glu Gly Arg ser Ala Gly Ser He Pro Gly Glu 
35 40 45 

Thr Asp Asn Asn Lys Thr Phe Pro ser He Gin He Leu Asn 
50 55 60 



Glu Pro 
15 

Phe Arg 
His Ser 
Tyr Phe 



Gly Lys Val Lys He Arg Thr Thr Leu Val Thr Lys Asn Glu Pro Tyr 
65 70 75 80 

Lys Pro His Pro His Asp Leu val Gly Lys Gly Cys Arg Asp Gly Tyr 
85 90 95 

Tyr Glu Ala Glu Phe Gly Pro Glu Arg Gin Val Leu Ser Phe Gin Asn 
100 105 110 

Leu Gly He Gin Cys Val Lys Lys Lys Asp Leu Lys Glu ser He Ser 

115 120 125 



Leu Arg 
130 



He Ser Lys Lys Asn Pro Phe Asn Val Pro Glu Glu 
135 140 



Gin Leu 
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His Asn lie Asp GXu Tyr Asp Leu Asn Val Val Arg Leu cys Phe Gin 

145 150 155 160 

Ala Phe Leu Fro Asp Glu His Gly Asn Tyr Thr Leu Ala Leu Pro Pro 
165 170 175 

Leu lie ser Asn Pro lie Tyr Asp Asn Arg Ala Pro Asn Thr Ala Glu 
180 185 190 

Leu Arg lie Cys Arg Val Asn Lys Asn Cys Gly Ser val Lys Gly Gly 
195 20O 205 

Asp Glu Xle Phe Leu Leu Cys Asp Lys Val Gin Lys Asp Asp He Glu 
210 215 220 

Val Arg Phe Val Leu Gly Asn Trp Glu Ala Lys Gly Ser Phe Ser Gin 
225 230 235 240 

Ala Asp val His Arg Gin Val Ala He val Phe Arg Thr Pro Pro Phe 
245 250 255 

Leu Gly Asp He Thr 
260 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; peptide 

(iii) HYPOTHETICAL: NO 
{±V) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Met Asp Phe Leu Thr Asn Leu Arg Phe Thr Glu Gly He Ser Glu Pro 
1 5 10 15 

Tyr He Glu He Phe Glu Gin Pro Arg Gin Arg Gly Met Arg Phe Arg 
20 25 30 

Tyr Lys cya Glu Gly Arg Ser Ala Gly ser He Pro Gly Glu His Ser 
35 40 45 

Thr Asp Asn Asn Lys Thr Phe Pro Ser He Gin He Leu Asn Tyr Phe 
50 55 60 

Gly Lys Val Lys He Arg Thr Thr Leu Val Thr Lys Asn Glu Pro Tyr 
65 70 75 80 
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Lys Pro His Pro His Asp Leu Val Gly Lys Gly Cys Arg Asp Gly Tyr 
85 90 95 

Tyr Glu Ala Glu Phe Gly Pro Glu Arg Gin Val Leu Ser Phe Gin Asn 
100 105 110 

Leu Gly He Gin Cys Val Lys Lys Lys Asp Leu Lys Glu ser He Ser 
115 120 125 

Leu Arg He Ser Lys Lys lie Asn Pro Phe Asn Val Pro Glu Glu Gin 
130 135 140 

Leu His Asn He Asp Glu Tyr Asp Leu Asn Val Val Arg Leu Cys Phe 
145 150 155 160 

Gin Ala Pile Leu Pro Asp Glu His Gly Asn Tyr Thr Leu Ala Leu Pro 
165 170 175 

Pro Leu He Ser Asn Pro He Tyr Asp Asn Arg Ala Pro Asn Thr Ala 
180 185 190 

Glu Leu Arg He Cys Arg Val Asn Lys Asn cys Gly Ser Val Lys Gly 
195 200 205 

Gly Asp Glu He Phe Leu Leu Cys Asp Lys Val Gin Lys Asp Asp He 
210 215 220 

Glu val Arg Phe Val Leu Gly Asn Trp Glu Ala Lys Gly Ser Phe Ser 
225 230 235 240 

Gin Ala Asp Val His Arg Gin Val Ala He Val Phe Arg Thr Pro Pro 
245 250 255 

Phe Leu Gly Asp He Thr 
260 



(2) INFOHMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 314 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECtJLE TYPE; peptide 

(iii) HYPOTHETICAL: NO 

<iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

<xi) SEQtJrENCE DESCRIPTION: SEQ ID NO: 71: 



Met Ser Asn Lys Lys Gin Ser Asn Arg Leu Thr Glu Gin His Lys Leu 
15 10 15 
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ser Gin Gly Val He Gly lie Phe Gly Asp Tyr Ala Lys Ala His Asp 

20 25 30 

Leu Ala Val Gly Glu Val ser Lys Leu val Lys Lys Ala Leu Ser Asn 
35 40 45 

Glu Tyr Pro Gin Leu ser Phe Arg Tyr Arg Asp ser He Lys Lys Thr 
50 55 60 

Glu He Asn Glu Ala Leu Lys Lys He Asp Pro Asp Leu Gly Gly Thr 
65 70 75 80 

Leu Phe Val Ser Asn Ser ser He Lys Pro Asp Gly Gly He Val Glu 
85 50 95 

Val Lys Asp Asp Tyr Gly Glu Trp Arg Val Val Leu Val Ala Glu Ala 
100 105 110 

Lys His Gin Gly Lys Asp lie lie Asn lie Arg Asn Gly Leu Leu Val 
115 120 125 

Gly Lys Arg Gly Asp Gin Asp Leu Met Ala Ala Gly Asn Ala He Glu 
130 135 140 

Arg ser His Asn He Ser Glu He Ala Asn Phe Met Leu ser Glu ser 
145 150 155 160 

His Phe Pro Tyr Val Leu Phe Leu Glu Gly ser Asn Phe iieu Thr Glu 
165 170 175 

Asn He Ser He Thr Arg Pro Asp Gly Arg val Val Asn Leu Glu Tyr 



Ser Asn Phe Leu Thr Glu Asn He ser He Thr Arg Pro Asp Gly Arg 
210 215 220 

Val Val Asn Leu Glu Tyr Asn Ser Gly lie Leu Asn Arg Leu Asp Arg 
225 230 235 240 

Leu Thr Ala Ala Asn Tyr Gly Met Pro He Asn Ser Asn Leu cys He 
245 250 255 

Asn Lys Phe Val Asn His Lys Asp Lys ser He Met Leu Gin Ala Ala 
260 265 270 

Ser lie Tyr Thr Gin Gly Asp Gly Arg Glu Trp Asp Ser Lys He Met 
275 280 285 

Phe Glu He Met Phe Asp He Ser Thr Thr ser Leu Arg val Leu Gly 
290 295 300 



180 



185 



190 



Asn Ser Gly Ser Glu Ser His Phe Pro Tyr val Leu Phe Leu Glu Gly 
195 200 205 



Arg Asp Leu Phe Glu Gin Leu Thr Ser Lys 
305 310 
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(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I.ENGTH: 17 ainino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOIiECUI^E TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(V) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

cys Asp Thr Asp Asp Arg His Arg lie Glu Glu Lys Arg Lys Arg Lys 
1 5 10 15 

Thr 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 168 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v> FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Gly Asp Pro Gly Lys Lys Lys Gin His He Cys His He Gin Gly Cys 
15 10 15 

Gly Lys Val Tyr Gly Lys Thr Ser His Leu Arg Ala His Leu Arg Trp 
20 25 30 

His Thr Gly Glu Arg Pro Phe Met Cys Thr Trp Ser Tyr Cys Gly Lys 
35 40 45 

Arg Phe Thr Arg Ser Asp Glu Leu Gin Arg His Lys Arg Thr His Thr 
50 55 60 

Gly Glu Lys Lys Phe Ala Cys Pro Glu cys Pro Lys Arg Phe Met Arg 
65 70 75 80 
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ser Asp His Leu Ser Lys His lie Lys Thr His Gin Asn Lys Lys Gly 
B5 90 95 

Gly Pro Gly Val Ala Leu Ser Val Gly Thr Leu Pro Leu Asp Ser Gly 
100 105 110 

Ala Gly ser Glu Gly Ser Gly Tiir Ala Thr Pro ser Ala Leu lie Thr 
115 120 125 

Thr Asn Met Val Ala Met Glu Ala lie cys Pro Glu Gly lie Ala Arg 
130 135 140 

Leu Ala Asn Ser Gly lie Asn val wet Gin Val Ala Asp Leu Gin Ser 
145 150 155 160 

lie Asn lie ser Gly Asn Gly Phe 
165 



(2) INFORMATION FOR SEQ ID l?0:74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 181 amino acids 
(B> TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL s NO 

(iv> ANTl -SENSE: NO 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74s 

Ser Gly lie Val Pro Gin Leu Gin Asn lie Val ser Thr Val Asn Leu 

. 1 5 10 15 

Gly Cys Lys Leu Asp lieu Lys Thr lie Ala Leu Arg Ala Arg Asn Ala 
20 25 30 

Glu Tyr Asn Pro Lys Arg Phe Ala Ala Val lie Met Arg He Arg Glu 
35 40 45 

Pro Arg Thr Thr Ala Leu He Phe ser ser Gly Lys Met Val Cys Thr 
50 55 60 

Gly Ala Lys Ser Glu Glu Gin ser Arg Leu Ala Ala Arg Lys Tyr Ala 
65 70 75 80 

Arg Val Val Gin Lys Leu Gly Phe Pro Ala Lys Phe Leu Asp Phe Lys 
85 90 95 

He Gin Asn Met: val Gly Ser Cys Asp val Lys Phe Pro He Arg Leu 



100 



105 



110 
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Glu Gly Leu val Leu Thr His Gin Gin Phe ser ser Tyr Glu Pro Glu 
115 120 125 

Leu Phe Pro Gly Leu lie Tyr Arg Met lie Lys Pro Arg He val Leu 
130 135 140 



L8U He Phe Val Ser Gly Lys Val Val Leu Thr Gly Ala Lys Val Arg 
145 150 155 160 

Ala Glu He Tyr Glu Ala Phe Glu Asn He Tyr Pro He Leu Lys Gly 
165 170 175 

Phe Arg Lys Thr Thr 
ISO 



(2) INFORMATION FOR SEQ XD NO: 75: 

(1) SEQUENCE CHARACCTRISTXCS: 
(A> LENGTH: 85 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPES internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Ser Cys Phe Ala Leu He Ser Gly Thr Ala Asn Gin Val Lys Cys Tyr 
1 5 10 15 

Arg Phe Arg Val Lys Lys Asn His Arg His Arg Tyr Glu Asn Cys Thr 
20 25 30 

Thr Thr Trp Phe Thr Val Ala Asp Asn Gly Ala Glu Arg Gin Gly Gin 
35 40 45 

Ala Gin He Leu He Thr Phe Gly ser Pro ser Gin Arg Gin Asp Phe 
50 55 60 

Leu Lys His Val Pro Leu Pro Pro Gly Met Asn He Ser Gly Phe Thr 
65 70 75 80 

Ala Ser Leu Asp Phe 
85 



{2} INFORMATION FOR SEQ ID K0:76: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: HO 

(iv) AKTI -SENSE: NO 

(V) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

cys Pro cys Leu Leu lie Gly Thr Ser Giy ksn Gly Asn Gin Val Lys 

15 10 15 



Cys Tyr Ser Phe Arg Val Lys Arg Trp His Asp Arg Asp Lys Tyr His 
20 • 25 30 

His Thr Thr Thr Trp Trp Ala Val Gly Gly Gin Gly Ser Glu Arg Pro 
35 40 45 

Gly Asp Ala Thr Val He Val Thr Phe Lys Asp Gin ser Gin Arg ser 
50 55 60 

His Phe Leu Oln Gin val Pro Leu Pro Pro Gly Met ser Ala His Gly 
65 70 75 80 

val Thr Met Thr Val Asp Phe 
85 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECtJLE TYPE: peptide 

(iii) HYPOTHETICAL: HO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Pro Pro val He cys Leu Lys Gly Gly His Asn Gin Leu Lys Cys Leu 
1 5 10 15 



Arg Tyr Arg Leu Lys Ser Lys His Ser Ser Leu Phe Asp Cys He Ser 
20 25 30 
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Thr Thr Trp Ser Trp Val Asp Thr Thr ser Thr Cys Arg i»eu Gly Ser 
35 40 45 

Gly Arg Met Leu lie Lys Phe Ala Asp ser Glu Gin Arg Asp Lys Phe 
50 55 60 

Leu Ser Arg val Pro tieu Pro ser Thr Thr Gin val Phe Leu Gly Asn 
65 70 75 80 

Phe Tyr Gly Leu 



(2) INFORMATION FOR SEQ ID NO;78: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Pro Pro Val He Leu Val Arg Gly Gly Ala Asn Thr Leu Lys Cys Phe 
1 5 10 15 

Arg Asn Arg Ala Arg Val Arg Tyr Arg Gly Leu Phe Lys Tyr Phe Ser 
20 25 30 

Thr Thr Trp ser Trp val Ala Gly Asp Ser Thr Glu Arg Leu Gly Arg 
35 40 45 

ser Arg Met Leu He Leu Phe Thr Ser Ala Cys Gin Arg Glu Lys Pro 
50 55 60 

Asp Glu Thr Val Lys Tyr Pro Lys Gly Val Asp Thr Ser Tyr Gly Asn 
65 70 75 80 

Leu Asp Ser Leu 



(2) INFORMATION FOR SEQ ID NO: 79: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPS: peptide 
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(iii) HYPOTHETICAL; NO 
(iv) ANTI -SENSE: NO 
(V) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NOi79: 

Pro Pro Val Val Cya val Lys Gly Gly Ala Asn Gin Leu Lys Cys Leu 
1 5 10 15 

Arg Tyr Arg Leu Lys Ala Ser Thr Gin Val Asp Phe Asp Ser He ser 
20 25 30 

Thr Thr Trp His Trp Thr Asp Arg Lys Asn Thr Glu Arg He Gly Ser 
35 40 45 

Ala Arg Met Leu Val Lys Phe lie Asp Glu Ala Gin Arg Glu Lys Phe 
50 55 60 

Leu Glu Arg Val Ala Leu Pro Arg ser Val Ser Val Phe Leu Gly Gin 
65 70 75 80 

Phe Asn Gly Ser 



(2> INFORMATION FOR SEQ ID NO:80f 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 aittino acids 

(B) TYPE: aanino acid 
(D) TOPOLOGY: lineaur 

(ii) MOLECULE TYBE: peptide 

(iii) HYPOTHETICAL: NO 

<iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Thr Pro lie Val Gin Leu Gin Gly Asp Ser Asn Cys Leu Lys Cys Phe 
15 10 15 

Arg Tyr Arg Leu Asn Asp Lys Tyr Lys His Leu phe Glu Leu Ala Ser 
20 25 30 

Ser Thr Trp His Tarp Ala Ser Pro Glu Ala Pro His Lys Asn Ala lie 
35 40 45 

Val Thr Leu Thr Tyr Ser Ser Glu Glu Gin Arg Gin Gin Phe Leu Asn 
50 55 60 
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Ser Val Lys lie Pro Pro Thr lie Arg His Lys Val Gly Phe Met ser 
65 70 75 80 

Leu Eis Leu Leu 



<2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 84 amino acids 
(B> TYPE: amino acid 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) AKTI -SENSE: NO 

(V) FRAGMENT TYPE: internal 

{Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Thr Pro lie val Gin Phe Gin Gly Glu Ser Asn Cys Leu Lys Cys Phe 
1 5 10 15 



Arg Tyr Arg Leu Asn Arg Asp His Arg Bis Leu Phe Asp Leu lie Ser 
20 25 30 

Ser Thr Trp His Trp Ala Ser Ser Lys Ala Pro His Lys His Ala lie 
35 40 45 

Val Thr Val Thr Tyr Asp Ser Glu Glu Gin Arg Gin Gin Phe Leu Asp 
50 55 60 

Val val Lys He Pro Pro Thr He Ser His Lys Leu Gly Phe Met Ser 
65 70 75 80 

Leu His Leu Leu 



(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

<iv) ANTI-SENSE: NO 
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(V) FRAGMENT TYPE: internal 

(xi) SEQXraNCE DESCRIPTION: SEQ ID NO: 82: 

Thr Pro lie lie His Leu Lys Gly Asp Arg Asn Ser Leu Lys cys Leu 
15 10 15 

Arg Tyr Arg Leu Arg Lys His Ser Asp His Tyr Arg Asp lie Ser Ser 
2-0 25 30 

Thr Trp His Trp Thr Gly Ala Gly Asn Glu Lys Thr Gly lie Leu Thr 
35 40 45 

CI Val Thr Tyr His ser Glu Thr Gin Arg Thr Lys Phe Leu Asn Thr val 

m 50 55 60 

ii Ala lie Pro Asp Ser Val Gin lie Leu Val Gly Tyr Asn Thr Met Tyr 

^=1 65 70 75 80 

w 

T« (2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

Jjt (A) LENGTH: 80 amino acids 

ijt (B) TYPE: amino acid 

^jf (D) TOPOLOGY: linear 

^ (ii) HOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(v> FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:83: 

Thr Pro lie Val His Leu Lys Gly Asp Ala Asn Thr Leu Lys Cys Leu 
1 5 10 15 

Arg Tyr Arg Phe Lys Lys His cys Thr Leu Tyr Thr Ala Val Ser Ser 
20 25 30 

Thr Trp His Trp Thr Gly His Asn Tyr Lys His Lys Ser Ala lie val 
35 40 45 

Thr Leu Thr Tyr Asp Ser Glu Trp Gin Arg Asp Gin Phe Leu Ser Gin 
50 55 60 

Val Lys lie Pro Lys Thr He Thr Val ser Thr Gly Phe Met ser He 
65 70 75 80 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 
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(B) TYPE: amino acid 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(V) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEO ID NO: 84: 

Ala Pro lie Val His Leu Lys Gly Glu Ser Asn Ser Leu Lys Cya Leu 
15 10 15 

Arg Tyr Arg Leu Lys Pro Tyr Asn Glu Leu Tyr Ser ser Met Ser Ser 
20 25 30 

Thr Trp His Trp Thr ser Asp Asn Lys Asn ser Lys Asn Gly He Val 
35" 40 45 

Thr val Thr Phe Val Thr Gly Gin Gin Gin Gin Met Phe Leu Gly Thr 
50 55 60 

val Lys He Pro Pro Thr Val Gin He Ser Thr Gly Phe Met Thr Leu 
65 70 75 80 

val 



(2) INFORMATION FOR SEQ ID NO: 85: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D> TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

<iv> ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

Gly He val Glu Gin cys Cys Thr Ser He Cys ser Leu 0?yr Gin Leu 
1 5 10 15 



Glu Asn Tyr cys Asn 
20 



(2) INFORMATION FOR SEQ ID NO: 86: 



wo 96/17956 



PCT/DS95yi5944 



106 



<i) SEQUENCE CHARACTERISTICS T 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) hypothetical: no 

(iv> ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 
(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:86: 

Phe Val Asn Gin His Leu Cys Gly ser His Leu Val Glu Ala Leu Tyr 
1 5 10 15 

Leu Val cys Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys Thr 
20 25 30 



{2) INFORMATION FOR SEQ ID NO: 87: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D> TOPOLOGY; linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

Gly lie Val Glu Gin Cys cys Ala Ser val Cys ser Leu Tyr Gin Leu 
1 5 10 15 

Glu Asn Tyr Cys Asn 
20 



(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(iii) HYPOTHETICAL: NO 
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(iv) anti-sense: ko 

(V) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Phe Val Asn Gin Bis Leu Cys Gly ser Bis Leu Val Glu Ala Leu Tyr 
1 5 10 15 

Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys Thr 
20 25 30 

(2) INFORMATION FOR SEQ ID NOi89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) topology: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

<iV) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 
(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

Gin Leu Tyr ser Ala Leu Ala Asn Lys Cys Cys His Val Gly Cys lie 
1 5 10 15 

Lys Arg ser Leu Ala Arg Phe Cys 
20 

(2) INFORMATION FOR SEQ ID NO:90: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 33 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linecir 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 
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Asp Ser Trp Met Glu Glu Val He Lys He Cys Gly Arg Glu Leu val 

15 10 15 

Arg Ala Gin He Ala lie cys Gly Met ser Thr Trp Ser Lys Arg Ser 
20 25 30 



Leu 



(2) INFORMATION FOR SEQ XD NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

Glu Glu Lys Met Gly Thr Ala Lys Lys cys cys Ala He Gly cys Ser 
1 5 10 15 

Thr Glu Asp Phe Arg Met Val cys 
20 



(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 
(B> TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL? NO 
<iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: internal 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

Arg Pro Asn Trp Glu Glu Arg Ser Arg Leu Cys Gly Arg Asp Leu He 
1 5 10 15 



Arg Ala Phe He Tyr Leu Cys Gly Gly Thr Arg Trp Thr Arg Leu Pro 
20 25 30 
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Asn Phe Gly Asn Tyr Pro He Met 
35 40 




(2) INFORMATION FOR SEQ ID NO: 93; 

(i) SEQtJENCE CHARACTERISTICS: 

(A> IiENGTH: 182 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

<iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(V) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

Ser Gly He Val Pro Thr Leu Gin Asn He Val Ser Thr Val Asn Leu 
15 10 15 

Asp cys Lys Leu Asp Leu Lys Ala He Ala Leu Gin Ala Arg Asn Ala 



Glu Tyr Asn Pro Lys Arg Phe Ala Ala Val He Met Arg He Arg Glu 
35 40 45 

Pro Lys Thr Thr Ala Leu He Phe Ala Ser Gly Lys Met Val cys Thr 
50 55 60 

Gly Ala Lys ser Glu Asp Phe Ser Lys Met Ala Ala Arg Lys Tyr Ala 
65 70 75 80 

Arg He Val Gin Lys Leu Gly Phe Pro Ala Lys Phe Lys Asp Phe Lys 
85 90 95 

He Gin Asn He Val Gly Ser Cys Asp val Lys Phe Pro He Arg Leu 



Glu Gly Leu Ala Tyr ser His Ala Ala Phe Ser Ser Tyr Glu Pro Glu 
115 120 125 

Leu Phe Pro Gly Leu He Tyr Arg Met Lys val Pro Lys He Val Leu 
130 135 140 

Leu He Phe Val Ser Gly Lys He Val He Thr Gly Ala Lys Met Arg 
145 150 155 160 

Asp Glu Thr Tyr Lys Ala Phe Glu Asn He Tyr Pro val Leu Ser Glu 
165 170 175 

Phe Arg Lys He Gin Gin 



100 



105 



110 



180 
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(2) IKF0HM21TI0K FOR SEQ ID NO:94: 
(i) SKQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

<v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

Asn Ser Asn ser Thr Pro lie val His Leu Lys Gly Asp Ala Asn Thr 
1 5 10 15 

Leu Lys Cys Leu Arg Tyr Arg Phe Lys Lys His Cys Thr Leu Tyr Thr 
20 25 30 

Ala val ser ser Thr Trp His Trp Thr Gly His Asn Val Lys His Lys 
35 40 45 

Ser Ala lie Val Thr Leu Thr Tyr Asp Ser Glu Trp Gin Arg Asp Gin 
50 55 60 

Phe Leu Ser Gin val Lys lie Pro Lys Thr lie Thr Val ser Thr Gly 
65 70 75 80 

Phe Met Ser lie 



(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

<iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 



Asn Ser Asn Thr Thr Pro lie Val His Leu Lys Gly Asp Ala Asn Thr 
1 5 10 15 
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Leu Lys Cys Leu ftxg Tyr Arg Phe Lys Lys Bis Cys Thr Leu Tyr Thr 

20 25 30 

Ala vaX ser ser Thr Trp His Trp Thr Gly His Asn Val Lys His Lys 
35 40 45 

Ser Ala lie val Thr Leu Thr Tyr Asp ser Glu Trp Gin Arg Asp Gin 
50 55 60 

Phe Leu ser Gin Val Lys He Pro Lys Thr He Thr Val Ser Thr Gly 
65 70 75 80 

Phe Met ser He 



(2) INFORMATION FOR SEQ ID NO:96t 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino -acids 
<B) TYPE: amino acid 
(D> TOPOLOGY: linear 

(ii> MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

Ser Gly Asn Thr Thr Pro He He His Leu Lys Gly Asp Arg Asn Ser 
15 10 15 

Leu Lys cys Leu Arg Tyr Arg Leu Arg Lys His Ser Asp His Tyr Arg 
20 25 30 

Asp He Ser Ser Thr Trp His Trp Thr Gly Ala Gly Asn Glu Lys Thr 
35 40 45 

Gly He Leu Thr Val Thr Tyr His Ser Glu Thr Gin Arg Thr Lys Phe 
50 55 60 

Leu Asn Thr Val Ala He Pro Asp Ser Val Gin He Leu Val Gly Tyr 
65 70 75 80 

Met Thr Met 



(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQXJSNCE CHARACTERISTICS: 

(A) LENGTH: 84 axciino acids 

(B) TYPE J amino acid 
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{D> TOPOLOGY: linear 
(ii) MOLECUI.E TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv> ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQt^ENCE DESCRIPTION: SEQ ID NOs97t 

ser Gly Asn Thr Ala Pro lie Val His Leu Lys Gly Glu Ser Asn Ser 
1 5 10 15 

Leu Lys cys Leu Arg Tyr Arg Leu Lys Pro Tyr Lys Glu Leu Tyr ser 
20 25 30 



ser Met ser Ser Thr Trp His Trp Thr ser Asp Asn Lys Asn Ser Lys 
35 40 45 

Asn Gly lie Val Thr Val Thr Phe Val Thr Glu Gin Gin Gin Gin Met 
50 55 60 

Phe Leu Gly Thr val Lys lie Pro Pro Thr val Gin lie Ser Thr Gly 
65 70 75 80 

Phe Met Thr Leu 



(2) INFORMATION FOR SEQ ID KO:98: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTIOII : SEQ ID NO:98: 

ser Gly Asn Thr Ser Cys Phe Ala Leu He Ser Gly Thr Ala Asn Gin 
1 5 10 15 

Val Lys Cys Tyr Arg Phe Arg Val Lys Lys Asn His Arg His Arg Tyr 
20 25 30 

Glu Asn Cys Thr Thr Thr Trp Phe Thr Val Ala Asp Asn Gly Ala Glu 

35 40 45 



wo 96/17956 




PCT/US95/15944 



113 



Arg Gin Gly Gin Ala Gin lie Leu lie Thr Phe Gly Ser Pro Ser Gin 
50 55 60 

Arg Gin Asp Phe Leu Lys His val Pro Leu Pro Pro Gly Met Asn He 
65 70 75 80 

ser Gly Phe Thr Ala ser Leu Asp Phe 
85 



(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQXJENCE CHARACTERISTICS J 
<A) LENGTH: 7 amino acids 
(B) TVPEt amino acid 
<D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: C-terminal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

ser Asn Lys Lys Thr Thr Ala 
1 5 



(2) INFORMATION FOR SEQ ID NOtlOO: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(V) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
Asn Ser Asn Thr 



(2) INFORMATION FOR SEQ ID NO:101: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 4 amino acids 
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(B) TYPE: amino acid 

(D) TOPor^Y: linear 
(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: KO 
(iv) ANTI -SENSE: »0 

(v) FRAGMENT TYPE: internal 
(Xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 101: 

Ser Gly Asn Thr 

1 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) hypothetical: no 

(iv) ANTI -SENSE: NO 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

ser ser Gly ser ser Gly 
1 5 



(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

Cys Tyr Pro Glu lie Lys Asp Lys Glu Glu val Gin Arg Lys Arg 



.1 



5 



10 



15 
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(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CH31RACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

<iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: N-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

Met Glu Gin Arg He Thr Leu Lys Asp Tyr Ala Met Arg Phe Gly Gin 
15 10 15 

Thr Lys Thr Ala Lys Asp Leu Gly Val Tyr Gin Ser Ala He Asn Lys 
20 25 30 



Ala He His Ala Gly Arg Lys He Phe Leu Thr He Asn Ala Asp Gly 
35 40 45 

ser Val Tyr Ala Glu Glu Val Lys Pro Phe Pro Ser Asn Lys Lys Thr 
50 55 60 

Thr Ala 
65 



(2) INFORMATION FOR SEQ ID NO: 105: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear . 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

<v) FRAGMENT TYPE: N-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 



Met Glu Gin Glu He Thr Leu Lys Asp Tyr Ala Met Arg Phe Gly Gin 
1 5 10 15 
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Thr Lys Thr Ala Lys Asp Leu Gly val Tyr Gin ser Ala He Asn Lys 
2a 25 30 

Ala He His Ala Gly Arg Lys He Phe Leu a?hr He Asn Ala Asp Gly 
35 40 45 

Ser val Tyr Ala Glu Glu Val Lys Pro Phe Pro Ser Asn Lys Lys Thr 
50 55 60 

Thr Ala 
65 



(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS I 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) hypothetical: no 

(iv) ANTI-SENSE s NO 

(V) FRAGMENT TYPE : N-terminal 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

Met Arg Gin Arg He Thr Leu Lys Asp Tyr Ala Met Arg Phe Gly Gin 
15 10 15 

Thr Lys Thr Ala Lys Asp Leu Gly Val Tyr Gin Ser Ala He Asn Lys 
20 25 30 

Ala lie His Ala Gly Arg Lys He Phe Leu Thr He Asn Ala Asp Gly 
35 40 45 

Ser Val Tyr Ala Glu Glu Val Lys Pro Phe Pro Ser Asn Lys Lys Thr 
50 55 60 

Thr Ala 
65 



(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 a m ino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: N-terminal 
(xi) SBOtJENCE DESCRIPTION: SBQ ID NO: 107: 

Ser Thr Lys Lys Lys Pro Leu Thr Gin Glu Gin Leu Glu Asp Ala Arg 
1 5 10 15 

Arg Leu Lys Ala lie Tyr Glu Lys Lys Lys Asn Glu Leu Gly Leu Ser 
20 25 30 

Gin Glu ser Val Ala Asp Lys Met Gly Met Gly Gin Ser Gly Val Gly 
35 40 45 

Ala Leu Phe Asn Gly lie Asn Ala Leu Asn Ala Tyr Asn Ala Ala Leu 
50 55 60 

Leu Ala Lys He Leu Lys Val Ser Val Glu Glu Phe Ser Pro ser He 
65 70 75 80 

Ala Arg Glu He Tyr Glu Met Tyr Glu Ala Val Ser Met Glu Pro Ser 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 
{B> TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: N-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

Ser Thr Lys Lys Lys Pro Leu Thr Gin Glu Gin Leu Glu Asp Ala Arg 

1 5 10 15 

Arg Leu Lys Ala He Tyr Glu Lys Lys Lys Asn Glu Leu Gly Leu Ser 
20 25 30 

Gin Glu Ser Val Ala Asp Lys Met Gly Met Gly Gin ser Gly Val Gly 
35 40 45 

Ala Leu Phe Asn Gly He Asn Ala Leu Asn Ala Tyr Asn Ala Ala Leu 

50 .55 60 

Leu Ala Lys He Leu Lys Val Ser Val Glu Glu Phe ser Pro Ser He 
65 70 75 80 
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jaa Arg Glu lie Tyr Glu Met Cys Glu Ala Val Ser Met Glu Pro Ser 
85 90 95 



(2) INFORMATION FOR SEQ ID NO:109: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOI*ECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: N-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

Gly He Val Glu Gin Cys Cys Thr Ser He Cys ser Leu Tyr Gin Leu 
15 10 15 

Glu Asn Tyr Cys As n Mat Ser Met Glu Gin Arg He Thr Leu Lys Asp 
20 25 30 

Tyr Ala Met Arg Phe Gly Gin Thr Lys Thr Ala Lys Asp Leu Gly Val 
35 40 45 

Tyr Gin Ser Ala He Asn Lys Ala He His Ala Gly Arg Lys He Phe 
50 55 60 

Leu Thr He Asn Ala Asp Gly ser Val Tyr Ala Glu Glu Val Lys Pro 
65 70 75 80 

Phe Pro ser Asn Lys Lys Thr Thr Ala ser Asn Lys Lys Thr Thr Ala 
85 90 95 

Asn Ser Asn Thr Thr Pro He val His Leu Lys Gly Asp Ala Asn Thr 
100 105 110 

Leu Lys cys Leu Arg Tyr Arg Phe Lys Lys His Cys Thr Leu Tyr Thr 
115 120 125 

Ala val ser Ser Thr Trp His Trp Thr Gly His Asn Val Lys His Lys 
130 135 140 

ser Ala He val Thr Leu Thr Tyr Asp Ser Glu Trp Gin Arg Asp Gin 
145 150 155 160 

Phe Leu Ser Gin Val Lys He Pro Lys Thr He Thr Val Ser Thr Gly 
165 170 175 

Phe Met Ser He 
180 
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(2) INFO^TION TOR SZQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS s 
1(A) LENGTH: 113 amino acids 
1(B) TYPE: amino acid 
Vd) TOPOXiOGYt linear 

(ii) MOLECULE TYPE: protein 

<iii) HYFOTHBTICALi NO 
(iv) AHTI-SENSEs NO 

(V) FRAGMENT TYPE: N-terminal 
(xi) SECUENCE DESCRIPTION: SEQ ID NOj 

Gly lie val Glu Gin cys Cys Thr ser 

1 I 5 

Glu AsnjTyf Cys Asn Met Ser Met Glu 

20 25 



Tyr Ala 



Met Arg Phe Gly Gin Thr Lys 
35 40 

ser Ala He Asn l.ys Ala He 
55. 

He Asn Ala Asp Gly ser Val 
70 



Tyr Gin 
50 

Leu Thr 
65 

Phe Pro Iser Asn Lys Lys Thr Thr Ala 

I 85 

Cys Asp Thr Asp Asp Arg His Arg He 
XOO 105 

Thr 



:110: 

lie cys ser Leu Tyr Gin Leu 
10 15 

Gin Arg He Thr Leu Lys Asp 

30 

Thr Ala Lys Asp Leu Gly Val 
45 

His Ala Gly Arg Lys He Phe 
60 

Tyr Ala Glu Glu Val Lys Pro 
75 80 

ser Asn Lys Lys Thr Thr Ala 
90 95 

Glu Glu Lys Arg Lys Arg Lys 

110 



(2) INFORMATION FOR SEQ ID NO: 111: 

I 
f 

(i) SEOtJsbcE CHARACTERISTICS: 

(A) LENGTH: 292 amino acids 

(B) |typE: amino acid 
(0) boPOLOGY: linear 



(ii> MOLECjOLE TYPE: protein 
(iii) HYPOT^TICAL: NO 
(iv) ANTI-ISENSEs NO 

i 

(v) FRAGMENT TYPE: N-terminal 
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(Xi) SEQXJENCE DESCRIPTIOK: SEQ ID NO: 111: 

Pbe Vai Asn Gin His Leu Cys Gly Ser His Leu Val Glu Ala Leu Tyr 

1 Is 10 15 

Leu Val cys Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys Thr Met Ser 
20 25 30 

Met Glu Gin Glu He Thr Leu Lya Asp Tyr Ala Met Arg Phe Gly Gin 
35 I 40- 45 

Thr Lys Thr jftla Lys Asp Leu Gly Val Tyr Gin ser Ala He Asn Lys 
50 ! 55 60 



Ala He His jAla Gly Arg Lys He Phe Leu Thr He Asn Ala Asp Gly 
65 ! 70 75 BO 



Ser Val Tyr jAla Glu Glu Val Lys Pro Phe Pro ser Asn Lys Lys Thr 
[85 90 95 



Thr Ala ser' 



Asn Lys Lys Thr Thr Ala ser ser Gly ser ser Gly Ser 
100 105 110 



Gly He Val Pro Gin Leu Gin Asn He Val Ser Thr Val Asn Leu Gly 
115 120 125 



Cys Lys Leii Asp Leu Lys Thr He Ala Leu Arg Ala Arg Asn Ala Glu 
130 135 140 

Tyr Asn Pro Lys Arg Phe Ala Ala Val He Met Arg He Arg Glu Pro 

145 150 155 160 



Arg Thr Thr Ala Leu He Phe Ser Ser Gly Lys Met Val Cys Thr Gly 
165 170 175 

Ala Lys Ser Glu Glu Gin Ser Arg Leu Ala Ala Arg Lys Tyr Ala Arg 
180 185 190 

Val Val Gin Lys Leu Gly Phe Pro Ala Lys Phe Leu Asp Phe Lys He 
195 200 205 

Gin Asn Met Val Gly Ser cys Asp Val Lys Phe Pro He Arg Leu Glu 

210 i 215 220 

1 

Gly Leu Val Leu Thr His Gin Gin Phe ser ser Tyr Glu Pro Glu Leu 
225 230 235 240 

Phe Pro Gly Leu He Tyr Arg Met He Lys Pro Arg He Val Leu Leu 
245 250 255 



He Phe Val ser Gly Lys Val Val Leu Thr Gly Ala Lys Val Arg Ala 
V 260 265 270 



Glu He Tyr Glu Ala Phe Glu Asn He Tyr Pro He Leu Lys Gly Phe 
27,5 280 285 
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Arg Lys Thr Thr 
290 



(2) lOTORMATIOiJ FOR SEQ 10 NO: 112: 
(i) SEQUESIck CHARACTERISTICS: 



(A) 
(B) 



[.ENGTHj 273 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 



(ii) MOLECmE TYPE: protein 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
<v) FRAGMENT TYPiE: N -terminal * 

(Xi) SEQUENCE DESCRIPTION: SEO ID NO: 112: 

Phe val Asn Gin His Leu Cys Gly ser His Leu val Glu Ala Leu Tyr 

1 I 5 10 15 

Leu Val ^s Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys Thr Met Ser 
20 25 30 

Met Arg ^Gln Arg He Thr Leu Lys Asp Tyr Ala Met Arg Phe Gly Gin 

135 40 45 

Thr Lys J Thr Ala Lys Asp Leu Gly Val Tyr Gin Ser Ala Xle Asn Lys 
50 ^ 55 60 

Ala He' His Ala Gly Arg Lys Xle Phe Leu Thr He Asn Ala Asp Gly 
65 70 75 80 

Ser Val Tyr Ala Glu Glu Val Lys Pro Phe Pro Ser Asn Lys Lys Thr 



85 



90 



95 



Thr Ala ser Asn Lys Lys Thr Thr Ala Gly Asp Pro Gly Lys Lys Lys 
100 105 110 

Gin His : He Cys His He Gin Gly cys Gly Lys Val Tyr Gly Lys Thr 
\ 115 120 125 

ser His Leu Arg Ala His Leu Arg Trp His Thr Gly Glu Arg Pro Phe 
130 135 140 

Met cys Thr Trp ser Tyr cys Gly Lys Arg Phe Thr Arg ser Asp Glu 
145 150 155 160 

Leu Gin Arg His Lys Arg Thr His Thr Gly Glu Lys Lys Phe Ala cys 
165 170 175 

Pro Glu C^-B Pro Lys Arg Phe Met Arg Ser Asp His Leu Ser Lys His 
\ 180 185 190 
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He Lys Thr 

195 

Val Gly Thr 
210 

Thr Ala Thr 
225 

Ala lie Cys 
Val Met Gin 
Phe 



His Gin Asn Lys Lys Gly Gly Pro Gly Val Ala Leu Ser 
200 205 

Leu Pro Leu Asp Ser Gly Ala Gly Ser Glu Gly ser Gly 
215 220 

Fro ser Ala Leu lie Thr Thr Asn Met Val Ala Met Glu 
230 235 240 

Pro Glu Gly He Ala Arg Leu Ala Asn Ser Gly He Asn 
245 250 255 

Val Ala Asp Leu Gin ser He Asn He Ser Gly Asn Gly 

260 265 270 



(2) INFORMATION TOR SEQ ID NO: 113: 
(i) SEQUENCS CHARACTERISTICS: 

(A) LENGTH: 421 amino acids 
<B) TYPE: amino acid 
(D) TO^boLOGY: linear 

^ (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v> FRAGMENT TYPE: N-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

Gin Leu Tyr^ Ser Ala Leu Ala Asn Lys 
1 5 

Lys Arg ser Leu Ala Arg Phe Cys Met 
20 25 

Leu Lys Asp Tyr Ala Met Arg Phe Gly 

35 \ 40 

Leu Gly Val *?yr Gin Ser Ala He Asn 
50 55 

Lys He Phe Leu Thr He Asn Ala Asp 
65 70 

\ 

Val Lys Pro Pii^ Pro Ser Asn Lys Lys 

\ 85 

Thr Thr Ala Met Ala Asp Asp Asp Pro 
100 105 



113: 

Cys cys His Val Gly Cys He 
10 15 

Ser Met Arg Gin Arg lie Thr 
30 

Gin Thr Lys Thr Ala Lys Asp 
45 

Lys Ala He His Ala Gly Arg 
60 

Gly Ser val Tyr Ala Glu Glu 
75 80 

Thr Thr Ala Ser Asn Lys Lys 
90 95 

Tyr Gly Thr Gly Gin Met Phe 
110 
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His LeulAsn Thr Ala Leu Thr His Ser lie Phe Asn Ala Glu Leu Tyr 

115 120 125 



m 
%i = 



Ser Pro 
130 

Glu Gin 
145 

Pro ser 



ser Tyr 



lie Val 



ser Lep 
210 

Gly Pro 
225 

Val Thr 



Ala Cys 



Glu lie Pro Leu ser Thr Asp Gly Pro Tyr Leu Gin lie Leu 
135 140 

Pro Lys Gin Arg Gly Phe Arg Phe Arg Tyr Val cys Glu Gly 
150 155 160 

His Gly Gly Leu Pro Gly Ala ser ser Glu Lys Asn Lys Lys 
165 170 175 

Pro Gin Val Lys lie Cys Asn Tyr Val Gly Pro Ala Lys val 
180 185 190 

Gin Leu val Thr Asn Gly Lys Asn lie His Leu His Ala His 
195 26X) 205 

val Gly Lys His cys Glu Asp Gly val cys Thr Val Thr Ala 
215 220 

Lys Asp Met val Val Gly Phe Ala Asn Leu Gly lie Leu His 
230 235 240 

Lys Lys Lys Val Phe Glu Thr Leu Glu Ala Arg Met Thr Glu 
245 250 255 

He Arg Gly Tyr Asn Pro Gly Leu Leu val His Ser Asp Leu 
260 265 270 



Ala Tyr- Leu Gin Ala Glu Gly Gly Gly Asp Arg Gin Leu Thr Asp Arg 
i 275 280 285 

Glu Lys^ Glu He lie Arg Gin Ala Ala Val Gin Gin Thr Lys Glu Met 
290j 295 300 

Asp Leul ser Val Val Arg Leu Met Phe Thr Ala Phe Leu Pro Asp ser 
305 310 315 320 

Thr Glyi Ser Phe Thr Arg Arg Leu Glu Pro Val Val Ser Asp Ala He 
325 330 335 

Tyr Asp^ Ser Lys Ala Pro Asn Ala Ser Asn Leu Lys He Val Arg Met 
340 345 350 

Asp Argi Thr Ala Gly cys val Thr Gly Gly Glu Glu He Tyr Leu Leu 
: 355 360 365 

Cys Asp Lys Val Gin Lys Asp Asp He Gin He Arg Phe Tyr Glu Glu 
370; 375 380 



Glu Glu JiBn Gly Gly Val Trp Glu Gly Phe Gly Asp Phe ser Pro Thr 
385 390 395 400 



Asp Val kLs Arg Gin Phe Ala He Val Phe Lys Thr Pro Lys Tyr Lys 
405 410 415 
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Asp val 

420 




(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) nENGTH: 391 amino acids 
<B) TOPE: amino acid 
(D) TOPOLOGY: linear 

(ii> HOLECuAe TYPE: protein 

(iii> HYPOTHETICAL: NO 

i 

(iv) ANTI-SE^SE: NO 
(v) FRAGMEN^ TYPE: N-terminal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

j 

Met Arg Gin ^Arg He Thr Leu Lys Asp Tyr Ala Met Arg Phe Gly Gin 

1 ; 5 10 15 

Thr Lys Thr kla Lys Asp Leu Gly Val Tyr Gin Ser Ala He Asn Lys 
^0 25 30 

i 

Ala He His Ala Gly Arg Lys He Phe Leu Thr He Asn Ala Asp Gly 
35 1 40 45 

ser Val Tyr Ala Glu Glu Val Lys Pro Phe Pro Ser Asn Lys Lys Thr 
50 55 60 

Thr Ala Met ilia Glu Asp Asp Pro Tyr Leu Gly Arg Pro Glu Gin Met 
65 70 75 80 

Phe His Leu ksp Pro Ser Leu Thr His Thr He Phe Asn Pro Glu Val 
85 90 95 

Phe Gin Pro ciln Met Ala Leu Pro Thr Ala Asp Gly Pro Tyr Leu Gin 
too 105 110 

He Leu Glu Gin Pro Lys Gin Arg Gly Phe Arg Phe Arg Tyr Val Cys 
115 120 125 

Glu Gly Pro ser His Gly Gly Leu Pro Gly Ala Ser ser Glu Lys Asn 



130 



135 



140 



Lys Lys ser Tyr Pro Gin Val Lys He Cys Asn Tyr Val Gly Pro Ala 
145 150 155 160 

Lys val He Val Gin Leu Val Thr Asn Gly Lys Asn He His Leu His 

\ 165 170 175 

\ 
t 

Ala His Ser Leu \ Val Gly Lys His cys Glu Asp Gly He Cys Thr Val 

180 \ 185 190 
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Thr Ala Gly Pro Glu Asp Cys Val His Gly Phe Ala Asn Leu Gly lie 
lis 200 205 



Leu His vil Thr Lys Lys Lys Val Phe Glu Thr Leu Glu Ala Arg Met 
210 1 215 220 

Thr Glu Alk Cys lie Arg Gly Tyr Asn Pro Gly Leu Leu val His Pro 



225 



230 



235 



240 



Asp Leu Ali Tyr Leu Gin Ala Glu Gly Gly Gly Asp Arg Gin Leu Gly 
245 250 255 



Asp Arg Glv| Lys Glu Leu lie Arg Gin Ala Ala Leu Gin Gin Thr Lys 
260 265 270 



Glu Met Asp 
275 

Asp ser Thr 
290 

Ala lie Tyr 
305 

Arg Met Asp 



Leu Leu Cys 



Glu Glu Glu 

355 

Pro Thr Asp 
370 

Tyr Lys Asp 
385 



Leu ser Val Val Arg Leu Met Phe Thr Ala Phe Leu Pro 
280 285 

Gly Ser Phe Thr Arg Arg Leu Glu Pro Val Val Ser Asp 
295 300 

Asp Ser Lys Ala Pro Asn Ala Ser Asn Leu Lys lie Val 
310 315 320 

Arg Thr Ala Gly cys Val Thr Gly Gly Glu Glu He Tyr 
325 330 335 

Asp Lys Val Gin Lys Asp Asp lie Gin He Arg Phe Tyr 
340 345 350 

Glu Asn Gly Gly Val Trp Glu Gly Phe Gly Asp Phe ser 
360 365 

Val His Arg Gin Phe Ala He Val Phe Lys Thr Pro Lys 
375 380 

He Asn Xle Thr 
390 



(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LEl^GTH: 391 amino acids 
<B) TYlfe: amino acid 
(D) TOCOLOGY: linear 

{ii> MOLECtJLE; TYPE: protein 

< iii ) HYPOTHETICAL : NO 

(iv) ANTI -SENSE: NO 

(V) FRAGMENT TYPE: N-terminal 
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(Xi) SEQ^NCE DESCRIPTION: SEQ ID NO: 115: 



Met Glu 

1 

Thr iys 
Ala lie 



ser Val 
50 



Gin Glu He Thr Leu Lys Asp Tyr Ala Met Arg Phe Gly Gin 
5 10 15 

Thr Ala Lys Asp Leu Gly val Tyr Gin ser Ala He Asn Lys 
20 25 30 

His Ala Gly Arg Lys He Phe Leu Thr He Asn Ala Asp Gly 
35 40 45 

Tyr Ala Glu Glu val Lys Pro Phe Pro ser Asn Lys Lys Thr 
55 60 



70 



75 



80 



Thr Ala iMet Ala Glu Asp Asp Pro Tyr Leu Gly Arg Pro Glu Gin Met 
65 

Phe His 
Phe Gin 
lie Leu 



Leu Asp pro Ser Leu Thr His Thr He Phe Asn Pro Glu val 
85 90 95 

Pro Gin Met Ala Leu Pro Thr Ala Asp Gly Pro Tyr Leu Gin 
100 105 110 

Glu Gin Pro Lys Gin Arg Gly Phe Arg Phe Arg Tyr val cys 
115 120 125 



Glu Gly p»ro Ser His Gly Gly Leu Pro Gly Ala Ser Ser Glu Lys Asn 
130 i 135 140 

Lys Lys Ser Tyr Pro Gin Val Lys He Cys Asn Tyr val Gly Pro Ala 
145 \ 150 155 160 

Lys Val lie Val Gin Leu Val Thr Asn Gly Lys Asn He His Leu His 
165 170 175 

Ala His ^er Leu val Gly Lys His Cys Glu Asp Gly He Cys Thr Val 
180 185 190 

Thr Ala Qly Pro Glu Asp Cys Val His Gly Phe Ala Asn Leu Gly lie 
1^5 200 205 

i 

Leu His Val Thr Lys Lys Lys Val Phe Glu Thr Leu Glu Ala Arg Met 
210 215 220 

Thr Glu Ala Cys He Arg Gly Tyr Asn Pro Gly Leu Leu Val His Pro 
225 ■ 230 235 240 

Asp Leu Ala Tyr Leu Gin Ala Glu Gly Gly Gly Asp Arg Gin Leu Gly 
\ 245 250 255 

\ ■ ■ 

Asp Arg Glti Lys Glu Leu He Arg Gin Ala Ala Leu Gin Gin Thr Lys 
j 260 265 270 

Glu Met iisj Leu Ser Val Val Arg Leu Met Phe Thr Ala Phe Leu Pro 
275 280 285 
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Asp se^ Thr Gly Ser Phe Thr Arg Arg Leu Glu Pro Val Val Ser Asp 
29d 295 300 



Ala He 
305 

Arg Met 



Leu Leu 



Glu Glu 



Tyr Asp ser Lys Ala Pro Asn Ala ser Asn Leu Lys lie Val 
310 315 320 

Asp Arg Thr Ala Gly cys Val Thr Gly Gly Glu Glu Xle Tyr 
325 330 335 

cys Asp Lys Val Gin Lys Asp Asp He Gin He Arg Phe Tyr 
340 345 350 

Glu Glu Asn Gly Gly Val Trp Glu Gly Phe Gly Asp Phe ser 
355 360 365 



Pro Thr 

370 i 



|Asp Val His Arg Gin Phe Ala He Val Phe Lys Thr Pro Lys 
375 380 



Tyr Lys Asp He Asn He Thr 

385 1 390 



(2) INFORMATION FOR SEQ ID NO: 116 j 
(i) SEQUENCE CHARACTERISTICS: 



<B) 



LENGTH: 241 amino acids 
TYPE: amino acid 



(D> i TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
<iii> HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
(V) FRAGl^NT TYPE: N-terminal 

(xi) SEOUskcE DESCRIPTION: SEQ ID NO: 116: 

I 

Met Arg Gin Arg He Thr Leu Lys Asp Tyr Ala Met Arg Phe Gly Gin 
1 i 5 10 15 

Thr Lys Thr Ala Lys Asp Leu Gly Val Tyr Gin Ser Ala He Asn Lys 
20 25 30 

Ala He HiSs Ala Gly Arg Lys He Phe Leu Thr He Asn Ala Asp Gly 
3^ 40 45 

Ser Val Tyr Ala Glu Glu Val Lys Pro Phe Pro Ser Asn Lys Lys Thr 



50 



55 



60 



Thr Ala Asn Lys Lys Thr Thr Ala Gly Asp Pro Gly Lys Lys Lys 
65 70 75 80 

Gin His H^ cys His He Gin Gly cys Gly Lys Val Tyr Gly Lys Thr 
85 90 95 

/ 
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Ser 


His 


Leik 

1 


Arg Ala His 

100 


Leu Arg Trp His Thr Gly Glu Arg Pro Phe 

105 110 


Met 


Cys 


Thr 
115 


Trp Ser Tyr 


Cys Gly Lys Arg Phe Thr Arg Ser Asp Glu 
120 125 


Leu 


Gin 
130 


Arg 


His Lys Arg 


Thr His Thr Gly Glu Lys Lys Phe Ala Cys 
135 140 


Pro 
145 


Glu 


Cys 


Pro Lys Arg 
150 


Phe Met Arg Ser Asp His Leu Ser Lys His 
155 160 


lie 


Lys 


Thr 


His Gin ASH 
165 


Lys Lys Gly Gly Pro Gly Val Ala Leu Ser 
170 175 


val 


Gly 


Thjf 


Leu Pro Leu 
180 


Asp Ser Gly Ala Gly Ser Glu Gly Ser Gly 
185 190 


Thr 


Ala 


Tht- 
195 


Pro Ser Ala 


Leu lie Thr Thr Asn Met Val Ala Met Glu 
200 205 


Ala 


lie 
210 


cys 


Pro Glu Gly 


lie Ala Arg Leu Ala Asn Ser Gly lie Asn 
215 220 


Val 
225 


Met 


Glii 


Val Ala Asp 
230 


Leu Gin Ser lie Asn lie ser Gly Asn Gly 
235 240 



Phe 



<2> INFORMATION FOR SEO XD NO: 117: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) tiSNGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
(0) iTOPOLOGY: linear 

(ii) MOLECULE TYPE: cDKA 

(iii) HYPOTHETXCAL: SO 

(iv) ANTI-SESSE: NO 

{Xi> SEQ^NCE DESCRIPTION: SEQ ID NO; 117: 
GGGAMTNYCC 



(2) INFORMATION FOR SEQ 10 N0:118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: aznino acid 
(D) TOPOLOGY: linear 
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MOLECULE TYPE: protein 
HYPOT^TICAL: NO 
ANTI -SENSE: NO 
FRAGMENT TYPE: N-terrainal 



(ii) 
(iii) 
(iv> 

(V) 

(xi) SEQUENqE DESCRIPTION; SEQ ID NOillSs 



1 

Met Glu Proj val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser 

1 . : 5 10 15 

Gin Pro Lysj Thr Ala Cys Thr Asn Cys Tyr cys Lys Lys Cys Cys Phe 
20 25 30 

His cys GlnjVal Cys Phe He Thr Lys Ala Leu Gly He ser Tyr Gly 
35 ; 40 45 

i 

Arg Lys Lysj Arg Arg Gin Arg Arg Arg Ala His Gin Asn ser Gin Thr 

50 I 55 60 

His Gin Ala i Ser Leu ser Lys Gin 
65 ; 70 
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Claims 

What is claimed is: 

L A probe nudeic acid (PNA) comprising: 

(a) a singie-stranded sequence, 1/2 TBR, which is capable of fonning, under 
hybridimg conditions, a hybrid, TBR, with a 1/2 TBR present in a target nucleic 
acid(TPfA); 

(b) a singl< stranded sequence, 1/2 which is capable of fonning, under 
hybridizing conditions, a hybrid BBR, with about 0-10 1/2 BBR present in a 
booster nucleic acid (BNA); and 



(c) anOSA, 



which is no attached support and/or indicator, or an attadied siQ>port or 



other mtws of localization, including, but not limited to, attachm^ to beads, 

polymeri , and surfaces, and/or indicators; 
whmm said TBR is c^aple of binding with high afSnity to a TBA, said TBA being a substance 
capable of discriminating between a paired TBR and a TBR having unpaired nucleotides, and 
further, wherem said BBk is capable of binding with high affinity to a BBA, said BBA being a 
substance capable of discrirainating between a paired BBR and a BBR having unpaired nucleotides. 



2. A booster nucleic acid (BNA) con^rising: 

(a) a 1/2 BBR which has a sequ^ce which is complementary to a 1/2 BBR sequence 
in a PNA pr another BNA and which is capable of forming, unskr hybridizing 
condition^ hybrid, BBR, with the PNA; 

(b) an OSA/w licfa is no attached support or indicator or an attached support or other 



(c) 




means (^f] 
and: 



including but not limited to, attachment to beads, polymers, 
'or indicators; and 

:ation sites, 1/2 BBRs, for hybridization with additional BNAs; 



additional ny] 

wherein said BBR is capah e of binding with high aflSnity to a BBA, said BBA being a substance 
capable of discriminating between a paired BBR and a BBR having unpaired nucleotides. 



3. A Hairpin Nucidc Acid (HNA) cm^rising a single-stranded sequ^ce, 1/2 BBR, which 
under hybridizing conditio is is capable of forming a hairpin while at the same time binding to a 
BNA to fomi a BBR capabl 5 of binding a BBA, wherein said BBR is capable of binding with high 
a£5nity to a BBA, said BBA being a substance capable of discriminatii^ between a per&ct BBR and 
a BBR having unpaired nilcleotides. 



#1 
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4. HiePNAbf 
a niffileic acid bindin; ; 
RNA binding protein. 



^1 



daim 1 wherein the TBR is conq)rised of one or more recognition sites for 
protein, a DNA binding protein, a DNA-RNA hybrid binding protein or an 



5. IhePNA 
present in the genome 

organism \rfnch 



pfc 



claim^4wherein the TBR is a nucleic acid binding protein recognition site 
of a pathogen or is a binding site associated with a pathogenic condition in 
'^tdirate g^XHne orjis a nucleic acid binding protein recognition site present in the genome of an 
contajninates a fennentation process. 



6. ThePNA 



. )f claim 4 wherein the TBR is the HIV-LTR or a portion thereof 



(b) 



(c) 



7. A mdiiod i or detecting or localizing a specific TNA sequence, comprising the steps of: 
(a) hybri dizing said TNA with the PNA of claim i; 

hybr. dizing said PNA with a BNA containing a 1/2 BBR whose sequence is 
conq^Iementoy to a 1/2 BBR sequence in the PNA; 

addii^g the products of steps (a) and(b) containing a TBR and a BBR, to a sur&ce, 
liquii i or other medium containing a TBA; 

BAs to the mixture in step (c) wherein said BBA comprises: 
molecule or a portion of a molecule which is capable of selectively 
tg to a BBR; 
ible indicator; and 

prof^iced by the indicator attached to the BBA. 

8. Tlieme&oc crfdaiVT^hereinsaidiiidicat^^ 

catalyzhig reactions leading bi. production of colored reaction products; a radi<Hiuclide; colored 
beads. 

9. A m^odfc^ detecting the preseace in a sample of a specific Target Nucleic Acid, TNA, 
which comprises: 

(a) contacting said samph with a Probe Nucleic Acid, PNA, ^siiich, upon hybridizaticm 




with said TNA if present in said sample, forms a Target Binding Region, TBR, 
which ^ capable of binding a Target Binding Assembly, TBA; and 
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(b) contecting said sample, already in contact with said PNA, with a TBA capable of 
binding to aay TBRs fottaed by lie hybridization of said PNA and said TN A in the 
sample. 

10. A m^faod &r < i^ecting or localizing specific nucleic acid sequences with a high degree 
of sensitivity and specific^ which oanprises: 

(a) adding F|NAs containing a 1/2 BBR and a li^ TBR to a sample containing or 
suspectecl of containing TNAs containing 1/2 TBR sequences, to fom a complex 
having 'target binding regions, TBRs, formed by the hybridization of 
conq>leipentaiy 1/2 TBRs present in the PNAs and TNAs respectively; 

(b) bindmg jflie TBRs famed in Sep (a) to ai immobilized TBA to form a TBA-TNA- 
PNAo^lcx; 

(c) adding! Boosts Nucleic Acids, BNAs, containing boost» binding regions, 1/2 
BBRsJ to the complex fonned in step (b) sudi that the 1/2 BBRs in the BNAs 
hybiid|ze with the 1/2 BBR sequences piesait in the PNAs ot to 1/2 BBRs present 
in Bl«|As already bound to the PNA, to &xm BBRs, such that TBA-TNA-PNA- 

{BNX)n complexes are formed; 

(d) adding Hairpin Nucleic Acids, HNAs, containing 1/2 BBR sequences, to Ae 
^kx fonned instep (c) such that tl^ 1/2 BBRs in the HNAs hybridize with any 

avaimble 1/2 BBR sequences pr^ent in the BNAs of the complex of step (c), 
thereby capping the extension of the BNAs onto the TBA-TNA-PNA-(BNAX 
/ con^)lkcsofstcp{c)toformTBA-TNA-PNA-(BNAVHNAcompfe^^ 

(e) ' -ad3i ag^ooster Bmding Assemblies, BBAs, linked to indicator moieties, to the 

TBA-T1^A-PNA-(BNAVHNA ccHi5>lexes fonned in step (d) to fcMm TBA-TNA- 
PNA-(BNX-BBA)^-HNA complexes; and 

(f) detecting die signals produced by the indicator moieties linked to the TBAs, PNAs, 
BNAs, BBAs chtHNAs inthe TBA-TNA.PNA-(BNA-BBA)„-HNA complexes of 
step (e); 

wherein the TNA comprises: 

(i) J one or more specific 1/2 TBR nucleic acid sequences, the presence or 
absKice of which in a particular sample is to be confirmed; 

the PNA comprises:/ 
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(i) 



(ii) 



(iii) 



the BNA conpises: 
(i) 



(ii) 



(iii) 
av) 

the BBA comprises: 

(i) 



a sinHe-stranded sequence, 1/2 TBR, which is capable of forming, under 
l^rbridizing ccmditicras, a hybrid, TBR, with a 1/2 TBR present in a target 
nudeii! acid (TNA); 

a singli! stranded sequaice, 1/2 BBR, which is capable of fbrnmig, mider 
l^teid zing conditicns, a hybrid BBR with a 1/2 BBR presort in a booster 

nucleic acid (BNA); and 

an OS A, vtUch is no attached support and/or indicator, or an attached 
suppoit or other means of localization, including, but not limited to, 
attachi aent to beads, polymers, and surfaces, and/or indicators; 

a 1/2 IBBR, as shown in Figure l(IIb), whidi has a sequence which is 
coo?)lknentaiy to a 1/2 BBR sequence in a PNA and which is capable of 
forming, undCT hybridizing ccmditions, a hybrid, BBR, with the PNA; 
an O^A, which is no attached support or indictor or an attached stQtport 
or odler means of localization, including, but not limited to, attachoient to 

i 

bea^, polymers, and surfaces, and/or indicators; 

additional hybridization sites, 1/2 BBRs, for other BNAs; and 

seqi^irces, IFl BBRs, vAich can hybridize to BNAs already i^ridized to 

thetNA; 

a niolecule or a portion of a molecule which is capable of selectively 
bindmg to a BBR; and 

which is no attached support and/or indicator, or an attached 
suplpOTt or other means of localization, including, but not limited to, 
attadifflpnt to beads, polymers, and surfaces, and/or indicators; 



andtheTBAcon5)ri& 
(i) 

(ii) 



/ 




,a moleciite or a portion of a molecule which is capable of selectively 
bidding to a TBR; and 

nc^ attached support and/or indicator, or an attached support or other 
mkans of localization, including, but not limited to, attachment to beads, 
pMymers, and surfaces, and/or indicators. 



11. In a solid [phase hybridization mediod for detecting the presence of a target 
polynucleotide involving: immobilizing a target polynucleotide, if present in a test sample, directly 
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3 or via an intermediate captive structure, on a solid phase at a capture site; before, during or after said 

4 ionnobxUzatioQ, aching k detectable label to said target polynucleotide, if present; and detecting 

5 said label, if any, at said ca ptuiie site; the improvement comprising: 

6 (a) using a Tai get Binding Assembfy,TBA, as the means for achieving immobilize 

7 of said tai get polynucleotide, wherein said TBA binds only to a unique hybrid 

8 formed faei a specific Probe Nucleic Acid, PNA, and said target nucleic acid 

9 such that i perfect Target Binding Region, TBR, recognizable by said TBA is 

10 formed; and 

11 (b) including in the PNA a single stranded sequence, 1/2 BBR, capable of binding a 

12 Booster N^ddc Add, BNA, oontaintng a single stranded complementary 1/2 BBR 

13 v^da, tqjcjn hybridizatiai with the 1/2 BBR in the PNA, forms a BBR capable of 

14 binding l^led Booster Binding Assemblies, BBAs. 

/ I 

1 12. A target bindipg assembly, TBA, or a booster binding assembly, BBA, comprising at 

2 feast one nucleic acid recognition unit, and optionally one or all of the sequences selected from the 

3 group consisting of a linker sequence, an assembly sequence, an asymmetry sequence, a nuclear 

4 localization signal sequaite (NLS) and an OSA, 



1 13. The TBA of claim 12 wherein the nucleic acid recognition unit is selected from the 

2 group consisting of an >IF-kB binding unit, an SP 1 binding unit, a TATA binding unit, a human 

3 papiUonmvinisE2bin*Sik unit, an HPVLTR binding unit, an HIV LT^ 



1 
2 
3 
4 
5 
6 
7 



14. The^A 6f claim 13 wberein the mdeic acid recognition unit has the sequence 
sdected from the ^aap cxxish 



N0.66,SEQE>>JO. 



ag of SEQ ©NO. 63, SEQ IDNO. 64, SEQ ID NO. 65, SEQ E) 
)N0. 68. SEQ ED NO. 69, SEQ E) NO. 70, SEQ BONO. 71, SEQ 



m NO. 72, SEQ lb NO. i73, SEQ ID NO. 74, SEQ O NO. 75, SEQ ID NO. 76, SEQ ID NO. 77, 
SEQIDN0.78,SEQIDiN0.79,SEQIDNO. 80, SEQ IDNO. 81, SEQ IDNO. 82, SEQ ID NO. 
83,SEQlDN0.8j>,SE(^IDNO. 93, SEQ ED NO. 94, SEQ ID NO. 95, SEQ ID NO. 96, SEQ ID 
NO. 97, SEQ ID lljO. 9k and SEQ n> NO. 11 8. 



1 15. Hie TBA ,bf claim 12 w ^ein the linker sequmce is an oligopeptide vMch does not 

2 interfere with the nucl^c acid recognition function of the nucldc acid recognition unit and which 

3 provides stability and c^ttrol over tibe spacing of tbe nucleic acid recognition unit from the remainder 

4 offteTBA. 
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16. TheTBAof 
interdomain primary 



17. Tlis 
directs tl» fcdding and 



sequence 



TBAofdaim 



ckaim JJjH^ieidn the linker sequence is an oligopeptide sequence &oai the 
of a structural protem. 



association 



1 2 ^vdierein the assembly sequence is an oligopeptide sequence which 
of nucleic acid recognition units. 



18. The TBA of ckim ITjwterein the assembly sequence is derived from the bacteriophage 



lambda m? protein or the C 
SEQIDNO. 105, SEQID 



protein and is selected from the group consisting of SEQ ED NO. 104, 
NO. 106, SEQ ID NO. 107, and SEQ ID NO. 108. 



19. TheTBAof 
acid recognition and 



cli dm 12 vdieFcin the asymmetry sequoice directs the association of nucleic 
assem bly isequences in a predetermined order. 



20. TheTBAof 
gonadotropic hormcme, FSp, 



claimj9^erein the asymmetry sequence is derived from insulin, 
HCa LH, ACTa or relaxin. 



21. The TBA 
consisting of SEQ ID NO, 
SEQ ID NO. 90, SEQ 



of claim 20 wteein the asymmetry sequence is selected from tiK group 
L SEq'toNO- 86, SEQ ID NO. 87, SEQ ID NO. 88, SEQ ID NO, 89, 
ID NO. 9 1, and SEQ ID NO. 92. 



22. The TBA oC^lainl 12whcrein the NLS is an oHgop^tide which directs the migration 
and uptake of a protein/()r complex associated with said NLS into the nucleus of a cell. 



/ 



23. TheTBi^ 



ID NO. 72 and SEQ ID NO 



of claun ^1 wb 
10^. 



2 wherein the NLS is selected &om the group consisting of SEQ 



24. The TBA itf cl?im^2jES^di is HIV Detect I-IV or HPV Detect I-IV. 



25. The TBA of claim j2hamg a sequence selected from the groiq) consisting of SEQ ID 
NO. 109, SEQ ID NO. 1 10 SEQ ID NO. 1 1 1, SEQ ID NO. 1 12, SEQ ID NO. 1 13, SEQ ID NO. 
114, SEQ IDNO. 115, andfsEQ IDNO. 116. 



/ 
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26. A method of u^g the TBA of claim 12 to bind a particular nucleic acid sequence in 
a target nucleic acid sample which comprises: 

(a) fragmentii^ the nucleic acid in the target nucleic acid sample; 

(b) contacting, u ider hybridizing conditions, the fragmented nucleic acid with a probe 
nucleic acid complementary to the particular nucleic acid sequence of interest, 
wherein said probe nucleic acid, upon hybridization with said particular nucleic 
acid sequence of interest forms a target binding region to which said TBA 
specifically binds. 



27. Themethod/of 



26 wherein said probe nucleic acid, in addition to sequences 
compleoffintary to saidp^ci^Wcleic acid sequence of interest, also has additional seqraices to 

bind to form a booster binding site to which a labeled booster 
binding assembly can (bind o provide a signal showing and amplifying the binding of the probe 
nucleic acid to the target nuc|eic acid sequence of interest. 

28. A method of i^ing the TBA of claim 12 wherein said TBA is administered to patient 
in need of such treatment a Aerapeutically or prophylactically effective amount of said TBA, which 
conqmses administering the TBA, either in the form of a purified protein complex or in the form of 
a recombinant vector which, upon entry into the patient is able to express the TBA, sudi that the 
TBA binds the particular nucleic acid sequence to achieve the desired prophylactic or therapeutic 
result. 
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29. The method ofclaim 28 wherm said TBA is sdected from the ^oupoonsistm 
©NO. 109,SEQIDNO. ilO,SEQmNO. 111,SEQIDN0. 112, SEQIDNO. il3,SEQIDN0. 
1 14, SEQ YD NO. 1 15, and SEQ E) NO. 1 1 6, and the patient is infected with HIV or HPV. 

30. The method 3f claim 26 fiirther comprising the st^ of: 

(c) monitorikg the shift in mobility of nucleic adds in the target nucleic add sample 
as a fi^on of the size such that binding of the TBA to a particular fragment in 



thes 




les the mobility of the fragment. 



SLA diagnostikoV forensic test kit for the detection in a sample of nucleic acid having a 
specific sequence composition, which comprises: 
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3 (a) a first nu:leic acid probe complementary to nucleic acid with specific sequence 

4 compositicKi, the presence of which is to be ascertained in a test sanq)le, wherein said first nucleic 

5 acid probe and said nuc cic add with specific sequence conqyosition forming, upon hybridization, 

6 a binding site for a first n icleic arid bindmg protein, and wherein smd first nucleic acid probe fiirther 

7 comprises additional so* juence complementary to a second mideic acid probe; 

S (b) a first nuci Ac acid binding protein specific for the duplex formed by hybridization of 

9 said first nucleic acid p^be and said nucleic acid with specific sequence composition; 

10 (c) a second mKjleic acid probe complementary to said additional sequent 

1 1 nucleic acid probe, wherein, upon hybridization of said first and second nucleic acid probes, a 

12 binding site for a secoifd nucleic acid binding protein is formed; 

13 (d) a second Luclcic acid binding protein which binds specifically to the di^lex formed 

14 t^xm l^teiizatian of spid first nucleic acid probe and said second nucleic acid probe, wha:cin said 

15 second nucleic acid binding protein is labeled with a detectable label 
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32. The diifenostic or forensic test kit of claimjlwherein said first nucleic acid probe is 
complanentaiy to the HTV LTR, such that upon hybridization of said first nucleic acid probe with 
an HIV LTR, a bindmg site is formed for NF-kB or a subunit thereof SP 1 , TATA bmding protcm, 
HIV-Detect I, n, ni or IV, or HIV-Lock. 

33, The diagnostic or forensic test kit of claim 32 wherein said first nucleic acid binding 



protein is OTrk^<|r 
HIV-Lock/ \l 



a subunit thereof; SPl, TATA binding protein, HIV-Detect I, II, IE, or IV, or 




itic or forensic test kit of claim 33 whe rein said first nucleic add probe, in 
addWon to bdng cbn^lementaiy to the HTV LTR, comprises a sequence encoding the bacteriophage 
lambda left or ri|pt cfiprator and said seocaKl niKleic add probe comprises sequences con^lementaiy 
to said bacteriopjiage lambda left or right operator sequences in said first nucleic acid probe, such 
that upon iQ^briidization of said first and second nucleic acid probes, a binding site for the 
bactericphage laAibda CI repressor protein, ±e bacteriophage lambda cro protein or a derivative or 
homology thereof, is formed. 



35. Th5 diagnostic or forensic test kit of claim 34 w herein said second nucldc acid binding 
protein is the bi icteriophage lambda CI repressor protein, the bacteriophage lambda cro protein or 
a derivative or homology thereof 
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y 

36. AconzpositioiL 



L ccn5>rising HIV-Lock or a recombinant vector encoding HIV-Lock and 
a phannaceutically acceptJ ible carrier. 

37. A mrthod oil differentially binding a nucleic acid bmding protein to a nucleic acid 
sequence correlated with a pathogenic condition which comprises: 

(a) selecting a particular configuration of nucleic acid binding protein sequ^xces 
present in^ the nucleic acid sequence correlated with a pathogenic condition as a 
targ^ sequence for designing a probe nucleic acid which will hybridize to that 
particular iconfiguration of nucleic acid seqiiences if present in a test sample, and 
fiirther, cnkring that a binding site for an available nucleic acid binding protein is 
fonned i^n hybridization of said probe nucleic acid and said particular 
configurai ion of nucleic acid sequences chosen as a target; 

(b) selecting a nucleic acid binding protein which specifically binds to fiie selected 
particular configuration of nucleic acid binding protein sequences correlated with 
a pathogc nic condition, but which does not bind to sequences not correlated with 
said pathogenic condition; 

(c) hjfaidiimg said probe nudeic acid with a test sample suspected of containing said 
particular configuration of nucleic acid binding protein sequences present in nucleic 
arid sequences cofrelated with a pathogenic condition; 

(d) contactinjg said nucleic acid binding protein with any hybrids fcHined in step (b); 
and 

(e) detemjd any binding of said nuclric acid binding protein with said hybrids. 



1 
2 
3 



38. The r^ethodjof claim 37 wherein said particular configuration of nucleic acid bindmg 
protein sequenc^ '^^.^^k^ firom a necessary step or control point in the development of i 
pathogenic condi^on. 



V 



\ 

! \ 



39. The methii of daim 9 wherein said method is carried out in an automated fashion. 



1 40. The methcjd of daim 3 9whercin the method is carried out in the Abbott Laboratories 

2 IMx machine. 



1 



41. The meti^od of claim 9 carried out in a microtiter plate. 
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42. A method of amplifying the signal obtained dirough binding the PNA of claiml to a 
TNA flMch cwnprises binding BNAs to the PNA-TNA Itybrid and binding labefcd BBAs to the 
BNAs. \ 

I 

of asseoibling a nucleic acid binding con^lex which CQnq)rises using 
asymnBtiy seqoE^ to direct flie association or mm-associatioa of components of the nudeic acid 
binding compl( 




4C A method of assemb 
assembly sequences derived &om 
con^xBients of the nucleic acid b: 




nucleic acid binding complex whidi c(»iq)rises using 
iophage lambda cro cx CI to assonble associated 
iplex. 



J 
45. A 



muidmodcpiotdn 
protdn ccanplex to s; 
complex. 

/ 

46. A 





of using assembly, asynmietry, or piloting secpiences to assemble a 
coopris^ linking subunits to be incorporated into the multimaric 
Jy, asymmetry, or piloting sequence, and recovering said multimOTC 



dtion comprising SEQ ID NO. 105, SEQ ID NO. 106, or SEQ ID NO. 108. 



47. A nucleictacid encoding the TBA or BBA of claim 12. 



48, The TBA of claim 12 or a nucleic acid encoding said TBA, wherein the amino acid 



selected from the group consisting of set A, set B and set C, \^*erein said 
lows: 

[ce^ from Groups 




sequence of said TBA is 
sets are comprised as fj 
Se| yi^ 
A I + 

B 



wherein groups I- V cAns ist of sequences selected from: 
Groi?) Selected fix)m Sequences 

I Aiiyof!|EQIDNOS.85-92 

II MetSer,|inkedtoanyofSEQIDNOS. 104- 106, each of which is linked to SEQ 
ID NO. I 
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m 



IV 
V 



SEQIDNO. 100 
linked to dtiber^l 



/ 



SEQn5N0.74 
AnyofSEQID'^Ol 
SEQ ID NO. 99. 




to ai^ of SEQ IDNOS, 75-84 or 94-98; SEQ JD NO. 10 1 
NO. 74 of SEQ ID NO, 93; or SEQ ID NO. 102 linked to 
ID NO. 93; or any of SEQ ID NOS. 72, 103, 73, or 63-71 
04-108 



49. A method of assembling multimeric TBAs irt vivo or in situ \rfiidi comprises 
introducing component TBAs int ) a cell utilizing a covalently or non-covalcntly attadied protein or 
bi-layer vesicle or by introducing nucleic adds encoding component TBAs into a cell, said 
conD^onent TBAs each comprising a DNA recognition unit, assembly sequ^ces, asymmetry 
sequences, nuclear localization signal sequences, and optional linker sequences, such that \xpon 
proximal binding via the DNA i ecog^iition unit of each component TBA to nucleic acid sequences 
encountered in the nucleus or el sevAere in the cell, component expressed TBAs assemble via said 
assembly and asymmetry sequeaices into multimeric TBAs. 

50. A method for id^ti^^g nucleic acid binding molecules for preparation of a target 
binding assembb^ or a booster bindi ig assembly comprising: 

a. Obtaining a san^k containing the target nucleic add; 

b. Fragmenting the s^ple so as to expose the nucleic acids and to reduce the size 
complexity of the n|ucleic adds contained in the sample; 

c. Contacting a first jaliquot of the fragm^ited nucleic acids \vith a control buffer 
medium and contacting a second aliquot of the fragmented nucbic acids with the 
control buffer m^um containing a known profile of nucldc acid binding 
moleodes; 

d. Analyzing tfa^itw^^ liquots to identify fragments vMch have altered bdia vior in the 
aliquot contacted Wi^ the target binding molecules as opposed to the aliquot 
contacted ^viti^^! opntrol buffer medium; 



e. 



Identifyinj 




tg fragments which do exhibit altered behavior when 



contacted with the rmdeic acid binding molecule and either sequencing the nucleic 
add fragment to d^ermine whether known nuddc ^d binding molecule motifs are 
present, or diiecd^ identifying the nucleic arid binding molecule bound to the 

nucleic acid; and I 

I 
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coiiq>nsmg: 
a. 



c. 



[ 




I 



synthesiang T B As oon^jrising the nucleic acid binding molecides which produced 
the altered :)ehavior using assembly, asymmetry nuclear localization and, 
optionally, linker sequences. 

/ I . 

51. A method for idaitifymg specific nucleic acid sequences in a sanq>le 

Fragmenting me nucleic adds in said sample to ©q)ose the nucleic acids and reduce 
the size complexity of the nucleic acids; 

Cottoting ^TBA with the san5>le, said TBA comprising two or more nucleic acid 
binding congjcaaits each of A^iiich has a relatively weak binding for its nucleic acid 
recognition Imit w&m thet TBR but which in combination provides strong binding 
for tte^plete TBR; and 

R1imrnqtfn|r ai^ "cToss-taDc" produced by binding of the TBA to cousin nucleic 
aci</s that CQixtain individual recognition units, which comprises contacting the 
f with ekcess nucleic add binding components with relatively strong binding 
afSd^ for cousin nucleic acids that contain the individual recognition units but 
relatively Weak binding relative to the TBA*s affinity fcs- binding to the complete 
TBR having said two or more nucleic acid binding components. 
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El wai; filed 7DKC199S . Serial No. PCT/US95/15944 

I hereby state that I have reviewed and understand the contents of the above-identified specification, including the 
claims, as amended by any amendment referred to above. 

' ^ V I acknowledge the duty to disclose information which is material to the examination of this application in accordance 
with Title 37, Code of Federal Regulations, § 1 .56(a). 

I herebyclaimforeignprioritybenefltsunderTitie35, United States Code 119 of any foreign application(s) for patent 
^ or inventor'scertificatelistedbelowandhave also identifiedany foreign application for patent or inventor's certificate 
I having a filing date before that of the application on which priority is claimed: 

I Application Country Filing Date Priority Claimed 

I Serial No. 

I PCT/US95/15944 PCT 7 DEC 1995 Yes 

I hereby claim the benefit under Title 35, United States Code, § 120 of any United States application(s) listed beiow and , 
^1 insofar as the subject matter of each of the claims of this application is not disclosed in the prior United States 
l appUcation(s)in the manner provided by the first paragraph of Title 35, United States Code, § 1 12, 1 acknowledge the 
1 duty to disclose material information as defmed m Title 37, Code of Federal Regulations, § 1 .56(a) which occurred 
t between the filing date of the prior application and the national or PCT international filing date of this application: 



Application Filing Date Status (Patented, 

Serial No. Pending, Abandoned) 



08/353,476 December 9, 1994 Pending 

I hereby declare that aU statements made herein of my own knowledge are true and that all statements made on 
information and belief are believed to be true; and further that these statements were made with the knowledge that 
willful false statements and the like so made are punishable by fine or imprisonment, or both, under Section 1001 of 
Title 1 8 of the United States Code, and that such willful false statements may jeopardize iht validity of the application 
or any patent issued thereon, 

I hereby appoint the following persons registered to practice before the Patent and Trademark Office as my attorneys 
with full power of substimtionand revocation to prosecute this application and all divisions and continuations thereof 
and to transact all business in the Patent said Trademark Office connected therewith: Roman Saliwanchik, Reg. No. 
2L22?; David R. Saliwanchik, Reg. No. 31 ,79 4; JefP Lloyd, Reg. No. ^,589; Doran R. Pace, Reg. Ko.J8,261; 
Christine Q. McLeod, Reg. No. 36,2 13 ; Jay M. Sanders, Reg. No. 39,355^ Jeaii Kyle, Reg. No. 36,987; James S. Parker , 
Reg. No. 40,119, ' * 

I request that all correspondence be sent to: 

David R. Saliwanchik _ 
2421Ji:ar..41 Rt, StTset>..^MA:J^ 
Game syille, FL 32606-6 669_ 

I further request that all telephone communications be directed to: 

David R. Saliwanchik 
352-375-8100 
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^ ^ Docket No. GP-lOOCl 
Name of First or Sole Itiventor NStaan Weffiiaeer 

Residence Seattig. Washington VX^ fi^ Citizenship United States 

Post Office Address 300 Queen Anne Avenue. North. Suite 392 

Seattle. Washington 98109 USA 

Date ^M^^**^ /^/ 

SigEJMure of First or Sole Inv^tor 



Name of Second Inventor Ar t h^^^ M ^^^^^^ g^^ 

Residence Seattle. Washington „ Citizenship United States 

Post Office Address 300 Queen Aime Avenue, North, Suite 392 

Seattle, Washin gton 98109 USA 



Date 

Signature of Second Joint Invented J 



Name of Third Joint Inventor , 



Residence Citizenship. 

Post Office Address 



Date 



Signature of Third Joint Inventor 

Name of Fourth Joint Inventor 

Residence Citizenship 

Post Office Address 



\ Date 

Signature of Fourth Joint Inventor 

***********^5}!:(!^!*^^*5i:Ht ************** 



Page Two of Two Pages 



